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Identification of virulence associated regions RDl and RD5 leading to improve 
vaccine of M bovis BCG and M. microti 



5 Virulence associated regions have been sought for a long time in Mycobacterium. The 
present invention concerns tlie identification of 2 genomic regions which are shown to 
be , associated with a virulent phenotype in Mycobacteria and particularly in M, 
tuberculosis. It concerns also the fragments of said regions. 

10 One of these two regions are known as RD5 as disclosed in Molecular Microbiology 
(1999), vol. 32, pages 643 to 655 (Gordon S.V. et ah). The other region named RD1-2F9 
spans the known region RDl as disclosed in Molecular Microbiology (1999), vol. 32, 
pages 643 to 655 (Gordon S.V. et al.). Both of thie regions RDl and RD5 or at least one 
of them are absent froni the vaccine strains of M. bovis BCG and in M. microti^ strains 

15 found involved and used as live vaccines in the 1960*s. 



Other applications which are encompassed by the present invention are related to the use 
of all or part of the said regions to detect virulent strains of Mycobacterio. and 
particularly M tuberculosis in humans and animals. The region RD1-2F9 and RD5 are 
20 considered as virulence markers under the present invention. 
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The recombinant Mycobacteria and particularly M bovis BCG after modification of their 
genome by introduction of all or part of RD1-2F9 region and/or RD5 region in said 
genome can be used for the immune system of patients affected with a cancer as for 
example a bladder cancer. 



The present invention relates to a strain of M bovis BCG or M microti, wherein said 
strain has integrated all or part of the. region RD1-2F9 responsible for enhanced 
immunogenicity to the tubercle bacilli, especially the genes encoding tlie ESAT-6 and 
CFP-10 antigenes. These strains will be referred to as the M bovis BCQrrRDl or M. 
10 microti::KDl strains and are useful as a new improved vaccme for prevention of 
tuberculosis infections and for treating superficial bladder cancer. 

hfycobacterium bovis BCG (bacille Calmette-Gu6rin) has been used since 1921 to 
prevent tuberculosis although it is of limited efficacy against adult pulmonary disease in 

15 highly endemic areas. i4ycobacterium microti, another member of the Afycobacterium 
tuberculosis complex, was originally described as the infective agent of a tuberculosis- 
- like disease in voles (Microtus agrestis) \n the 1930's (Wells, A. Q. 1937. Tuberculosis 
in wild voles. Lancet 1221 and Wells, A. Q. 1946. The murine type of tubercle bacillus. 
Medical Research council special report series 259:1-42.). Until recently, M. microti 

20 strains were thought to be pathogenic only for voles, but not for humans and some were 
even used as a live-vaccine. In fact, the vole bacillus proved to be safe and effective in 
preventing clinical tuberculosis in a trial involving roughly 10,000 adolescents in the UK 
in the 1950's (Hart, P. D. a., and I. Sutherland. 1977. BCG and vole bacillus vaccines in 
the prevention of tuberculosis in adolescence and early adult Ufe. British Medical 

25 Journal 2:293-295). At about tiie same time, another strain, OV166, was successfully 
administered to half a million newborns in Prague, former Czechoslovakia, witiiout any 
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serious complications (Sula, L., and L Radkovsky. 1976. Protective effects of M. microti 
vaccine against tuberculosis. J. Hyg. Epid! Microbiol. Immunol. 20:1-6). M microti 
vaccination has since been discontinued because it was n6 more effective than the 
frequently employed BCG vaccine. As a result, improved vaccines are needed for 
5 preventing and treating tuberculosis. 

The problem for attempting to ameliorate this live vaccine is that the molecular 
mechanism of both the attenuation and the immunogenicity of BCG is still poorly 
understood. Comparative genomic studies of all six members of the M tuberculosis 
complex have identified more than 140 genes, whose presence is fecultative, that may 

1 0 confer differences in phenotype, host range and virulence. Relative to the genome of the 
paradigm strain, M tuberculosis H37Rv (S. T. Cole, et al, Nature 393, 537 (1998)), 
many of these genes occur in chromosomal regions that have been deleted from certain 
species (RDl-16, RvDl-5), M. A. Behr, et al.. Science 284, 1520 (1999) ; R. Brosch, et 
al.. Infection hnmun. 66, 2221 (1998) ; S. V. Gordon, et al., Molec Microbiol 32, 643 

15 (1999) ; H. Salamon, et al. Genome Res 10, 2044 (2000), G. G. Mahairas et al, J. 
Bacteriol. 178, 1274 (1996) and R. Brosch, et al., Proc Natl Acad Sci USA 99, 3684 
(2002). 

In coimection with the invention and based on their distribution among tubercle bacilli 
and potential to encode virulence fiiiictions, RDl, RD3-5, RD7 and RD9 (Fig. lA, B) 

20 Were accorded highest priority for fimctioiial genomic analysis using "knock-ins" of M 
bovis BCG to assess their potential contribution to the attenuation process. Clones 
spanning these Rl5 regions were selected from an ordered M tuberculosis H37Rv library 
of integratiiig shuttle cosmids (S. T. Cole, efal, NcOure 393, 537 (1998) and W. R. 
Bange, et al. Tuber. Lung Dis. 79, 171 (1999);, and individually electroporateid mto BCG 

25 Pasteur, where they inserted stably into the attB site (M. H. Lee, et al, Proc. Natl. Acad. 
Set. 88, 3 1 11 (1991);. 



wo 03/085098 



PCT/IB03/01789 



4 

We have uncovered that only reintroduction. of all or part of RD1-2F9 led to profound 
phenotypic alteration. Strikingly, the BCG::RD1 "knock-in" grew more vigorously than 
BCG controls in immuno-deficient mice, inducing extensive splenomegaly and 
granulorna formation. 

5 RDl is restricted to the avirulent strains M bovis BCG and M microti. Although the 
endpoints are not identical, the deletions have removed from both vaccine strains a 
cluster of six genes (Rv3871-Rv3876) lhat are part of the ESAT-6 locus (Fig. lA (S. T. 
Cole, et al. Nature 393, 537 (1998) and F. Tekaia, et al. Tubercle Lung Disease 79, 329 
(1999)). 

10 Among the missing products are members of the mycobacterial PE (Rv3872), PPE 
(Rv3873), and ESAT-6 (Rv3874, Rv3875) protein families. Despite lacking obvious 
. secretion signals, ESAT-6 (Rv3875) and the related protein CFP-10 (Rv3874), are 
abundant components of short-term culture filtrate, acting as immunodominant T-cell 
antigens that induce potent Thl responses (F. Tekaia, et ah. Tubercle Lung Disease 79, 

15 329 (1999) ; A. L. Sorensen, et al. Infect hnmun. 63, 1710 (1995) and R. Colangelli, et 
aL, Infect. Immun. 68, 990 (2000);. 

In smnmary, we have discovered that the restoration of RD1-2F9 to M bovis BCG leads 
to increased persistence in immunocompetent mice. The M bovis BCG::RDl strain 
20 induces RDl-specific immune responses of the Thl-type, has enhanced immunogjenicity 
and confers better protection than M bovis BCG alone in animal models of tuberculosis. 
The M bovis BCG::RD1 vaccine is significantly more virulent than M bovis BCG in 
immunodeficient mice but considerably less virulent than M tuberculosis, 

25 In addition, we show that M, microti lacks a different but overlapping part of the RDl 
region (RDl""*"^ to M bovis BCG and our results indicate that reintroduction of RDl- 
2F9 confers increased virulence of BCG ::RD1 in immunodeficient mice. The rare 
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Strains of M microti that are associated with hurnan disease contain a region referred to 
as RD5""'' whereas those from voles do not 

M hovis BCG vaccine could be improved by reintroducing other genes encoding ESAT- 
6 family members that have been lost, notably, those found in the RD8 and RD5 loci of 
5 M tuberculosis. These regions also code for adcUtional T-cell antigens. 

M. bovis BCG::RD1 could be improved by reintroducing the RD8 and RD5 loci of M 
tuberculosis. 

10 M. bovis BCG vaccine could be improved by reintroducing and overexpressing the genes 
contained in the RD 1 , RD5 and RDS regions. 

Accordingly, these new strains, showing greater persistence and enhanced 
immunogenicity, represent an improved vaccine for preventing tuberculosis and treating 
15 bladder cancer. 



In addition^ the greater persistence of these recombinant strains is an advantage for the 
presentation of other antigens, for instance from HIV in humans and in order to mduce 
protcQtion immune responses. Those improved strains may also be of use in veterinary 
20 medicine, for instance in preventing bovine tuberculosis. 

Description 

Therefore, the present invention is aimed at a strain of M. bovis BCG or M microti, 
wherein said strain has integrated all or part of the RI^1-2F9 region as shown in SEQ ID 
No 1 responsible for enhanced immunogenicity to the tubercle bacilli. These strains will 
25 be referred to as the M bovis BCG::RD1 or M. microti::^l strains. 
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In connection with the invention, "part or all of the RD1-2F9 region" means that tlie 
strain has integrated a portion of DNA originating from Mycobacterium tuberculosis or 
any virulent member of the A^cobacterium tuberculosis complex (M africanum, M. 
bovis, M. canettii), which comprises at least one, two, three, four, five, or more gene(s) 

5 selected from Rv3861 (SEQ ID No 4), Rv3862 (SEQ ID No 5), Rv3863 (SEQ ID No 6), 
Rv3864 (SEQ ID No 7), Rv3865 (SEQ ID No 8), Rv3866 (SEQ ID No 9), Rv3867 (SEQ 
ID No 10), Rv3868 (SEQ ID No 11), Rv3869 (SEQ ID No 12), Rv3870 (SEQ ID No 
13), Rv3871 (SEQ ID No 14), Rv3872 (SEQ ID No 15, mycobacterial PE), Rv3873 
(SEQ ID No 16, PPE), Rv3874 (SEQ ID No 17, CFP-10), Rv3875 (SEQ ID No 18, 

10 ESAT-6), Rv3876 (SEQ ID No 1 9), Rv3877 (SEQ ID No 20), Rv3878 (SEQ ID No 21), 
Rv3879 (SEQ ID No 22), Rv3880 (SEQ ID No 23), Rv3881 (SEQ ID No 24), Rv3882 
(SEQ ID No 25), Rv3883 (SEQ ID No 26), Rv3884 (SEQ ID No 27) and Rv3885 (SEQ 
ED No 28). The expression "a portion of DNA" means also a nucleotide sequence or a 
nucleic acid or a polynucleotide* The expression "gene" is referred herein as the coding 

15 sequence in frame with its natural promoter as well as the coding sequence which has 
^been isolated and framed with an exogenous promoter, for example a promoter capable 
of directing high level of expression of said coding sequence. 

In a ^ecific aspect, the invention relates to a strain of M bovis BCG or M microti 
v\*erein said strain has integrated at least one, two, three or more gene(s) selected from 

20 Rv3867 (SEQ ID No 10), Rv3868 (SEQ ID No 1 1), Rv3869 (SEQ ID No 12), Rv3870 
(SEQ ID No 13)JR.v3871 (SEQ ID No 14), Rv3872 (SEQ ID No 15, mycobacterial PE), 
Rv3873 (SEQ ID No 16, PPE), Rv3874 (SEQ ID No 17, CFP-10), Rv3875 (SEQ ID No 
1 8, ESAT-6), Rv3876 (SEQ ID No 19) and Rv3877 (SEQ ID No 20). 
In an another specific aspect, the invention relates to a strain of M bovis BCG or M 

25 microti wherein said strain has integrated at least one, two, three or more gene(s) 
selected from Rv3871 (SEQ ID No 14), Rv3872 (SEQ ID No 15, mycobacterial PE), 
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RV3873 (SEQ ID No 16, PPE), Rv3874 (SEQ ID No 17, CFP-10), Rv3875 (SEQ ID No 
18, ESAT-6) and Rv3876 (SEQ ID No 19). 

Preferably, a strain according to the invention is one which has integrated a portion of 
DNA originating from Mycobacterium tuberculosis or any virulent member of the 

5 Hd^ycobacterium tuberculosis complex (M africanunt, M bovis, M. canettii), which 
comprises at least four genes selected from Rv3861 (SEQ ID No 4), Rv3862 (SEQ ID 
No 5). Rv3863 (SEQ ID No 6), Rv3864 (SEQ ID No 7), Rv3865 (SEQ ID No 8), 
Rv3866 (SEQ ID No 9), Rv3867 (SEQ ID No 10), Rv3868 (SEQ ID No 11), Rv3869 
(SEQ ID No 12), Rv3870 (SEQ ID No 13), Rv3871 (SEQ ID No 14), Rv3872 (SEQ ID 

10 No 15, mycobacterial PE), Rv3873 (SEQ ID No 16, PPE), Rv3874 (SEQ ID No 17, 
CFP-10), Rv3875 (SEQ ID No 18, ESAT-6), Rv3876 (SEQ ID No 19), Rv3877 (SEQ ID 
No 20), Rv3878 (SEQ ID No 21), Rv3879 (SEQ ID No 22), Rv3880 (SEQ ID No 23), 
Rv3881 (SEQ ID No 24), Rv3882 (SEQ ID No 25), Rv3883 (SEQ ID No 26), Rv3884 
(SEQ ID No 27) and Rv3885 (SEQ ID No 28)., provided that it comprises Rv3874 (SEQ 

15 BO No 17, CFP-10) and/or Rv3875 (SEQ ID No 18, ESAT-6).. 

Strains which have integrated a portion of DNA originating from Mycobacteritm 
tuberculosis or any vfrulent member of the Mycobacterium tuberculosis complex (M 
qfricanum, M bovis, M canettii)compnsmg at least Rv3871 (SEQ ID No 14), Rv3875 
(SEQ ID No 18, ESAT-6) and Rv3876 (SEQ ID No 19) or at least Rv3871 (SEQ ID No 
20 14), Rv3875 (SEQ ID No 18, ESAT-6) and Rv3877 (SEQ ID No 20) or at least Rv3871 
(SEQ ID No 14), RV3875 (SEQ ID No 18, ESAT-6), Rv3876 (SEQ ID No 19) and 
Rv3877 (SEQ ID No 20) aire of particular interest 

The above strains according to the invention may fiirther comprise Rv3874 (SEQ ID No 
17, CFP-10), Rv3872 (SEQ ID No 15, mycobacterial PE) and/or Rv3873 (SEQ ID No 
25 16, PPE). In addition, it may further comprise at least one, two, three or four gene(s) 
selected from Rv3861 (SEQ ID No 4), Rv3862 (SEQ ID No 5), Rv3863 (SEQ ID No 6), 
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Rv3864 (SEQ ID No 7), Rv3865 (SEQ ID No 8), Rv3S66 (SEQ ID No 9), Rv3867 (SEQ 
ID No 10), Rv3868 (SEQ ID No 11), Rv3869 (SEQ ID No 12), Rv3870 (SEQ ID No 
13), Rv3878 (SEQ ID No 21), Rv3879 (SEQ ID No 22), RvSSSO (SEQ ID No 23), 
Rv3881 (SEQ ID No 24), Rv3882 (SEQ ID No 25), Rv3883 (SEQ ID No 26), Rv3884 
5 (SEQ ID No 27) and Rv3885 (SEQ ID No 28). 

The invention encompasses strains which have integrated a portion of DNA originating 
from A<fycobacterium tuberculosis or any virulent member of the Mycobacterium 
tuberculosis complex (M qfricanumy M bovis, M canettii)^ which comprises Rv3875 
. (SEQ ID No 18, ESAT-6) or Rv3874 (SEQ ID No 17, CFP-10) or both Rv3875 (SEQ ID 
10 No 18,ESAT-6)andRv3874(SEQIDNol7,CFP.10), 

These genes can be mutated (deletion, insertion or base modification) so as to maintain 
the improved immunogenicity wMle decreasing the virulence of the strains. Using 
routine procedure, the man skilled in the art can select the M bovis BCG::RD1 or M 
microtiiiKDl strains, in which a mutated gene has been integrated, showing improved 
15 immunogenicity and lower virulence. 

We have shown here that introduction of the RD1-2F9 region malces the vaccine strains 
induce a more effective immune response agauist a challenge with M. tuberculosis. 
However, this first generation of constructs can be followed by other, more fine-tuned 
generations of constructs as the complemented BCG::RD1 vaccine strain also showed a 
20 more virulent phenotype in severely immuno^mprdmised (SCID) mice. Therefore, the 
BCG::RD1 constructs may be modified so as to be applicable as vaccine strains while 
being safe for immuno-compromised indiyidualsThe term "construct" means an 
engineered gene unit, usually involving a gene of interest that has been fused to a 
promoter . 

25 
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In this perspective, the man skilled in the art can adapt the BCG::RD1 strain by the 
conception of BCG vaccine strains that only cany parts of the genes coding for ESAT-6 
or CFP-10 in a mycobacterial expression vector (for example pSM81) under the control 
of a promoter, more particularly an hsp60 promoter. For example, at least one portion of 

5 the esat-6 gene that codes for immunogenic 20-mer peptides of ESAT-6 active as T-cell 
epitopes (Mustafa AS, Oftung F, Amoudy HA, Madi NM, Abal At, Shaban F, Rosen 
Krands I, & Andersen P. (2000) Multiple epitopes from the Mycobacterium tuberculosis 
ESAT-6 antigen are recognized by antigen-specific human T cell lines. Clin Infect Dis. 
30 Suppl 3:8201-5, peptides PI to P8 are incorporated herein in the description) could 

10 be cloned into this vector and electroporated mto BCG, resulting in a BCG strain that 
produces these epitopes. 

Alternatively, the ESAT-6 and CFP-10 encoding genes (for example on plasmid RDl- 
AP34 and or RD1-2F9) could be altered by directed mutagenesis (using for example 
15 QuikChange Site-Directed Mutagenesis Kit from Stratagen) in a way that most of the 
immunogenic peptides of ESAt-6 remain intact, but the biological fiiiictionality of 
ESAT-6 is lost. 

This approach could result in a more protective BCG vaccine without increasing the 
virulence of the recombinant BCG strain. 

20 

Therefore, the invention is also aimed at a method for preparing and selecting M bovis 
BCG or M, microti recombinant strains comprising a step consisting of modifying the M 
bovis BCG::RD1 or M m/cro//::RDl strains as defined above by insertion, deletion or 
mutation in the integrated RDl region, more particularly in the esat-6 or CFP-10 gene, 
25 said method leading to strains that are less virulent for immuno-depressed individuals. 
Together, these methods would allow to explain what causes the effect that we see with 
our BCG::RD1 strain (the presence of additional T-cell epitopes from ESAT-6 and 
CFPIO resulting in increased immunogenicity) or whether the effect is caused by better 
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fitness of the recombinant BCG::RD1 clones resulting in longer exposure time of the 
immune system to the vaccine - or - by a combinatorial efifect of both factors. 

In a preferred embodiment, the invention is aimed at the M bovis BCG::RD1 strains, 
5 which have integrated a cosmid herein referred to as the RD1-2F9 and RD1-AP34 
contained in the E. coli strains deposited on April 2, 2002 at the CNCM (Institut Pasteur, 
25, rue du Docteur Roux, 75724 Paris cedex 15, France) under the accession number I- 
2831 and 1-2832 respectively. The RD1-2F9 is a cosmid comprising the portion of the 
Mycobacterium tuberculosis H37Rv genome previously named RD1-2F9 that spans the 
10 RDl region and contains a gene conferring resistance to Kanamycin. The RD1-AP34 is a 
cosmid comprising a portion of the . Aify'cobacteriunj tuberculosis H37Rv genome 
containing two genes coding for ESAT-6 and CFP-10 as well as a gene conferring 
resistance to Kanamycin. 

15 The cosmid RD1-AP34 contains a 3909 bp fragment of the M tuberculosis H37Rv 
genome from region 4350459 bp to 4354367 bp that has been cloned into an integrating 
vector pKint in order to be integrated in the genome of A^cobacterium bovis BCG and 
Afycobacterium microti strains (SEQ ID No 3). The Accession No. of the segment 160 of 
the M tuberculosis H37Rv genome that contains this region is AL022120. . 

20 

SEQ ID No 3: 

1 - gaattcccat ccagtgagtt caaggtcaag cggcgccccc ctggccaggc atttctcgtc 
25 . 61 - tcgccagacg gcaaagaggt catccaggcc ccctacatcg agcctccaga agaagtgttc 
121 - gcagcacccc caagcgccgg ttaagattat ttcattgccg gtgtagcagg acccgagctc 
1 8 1 - agcccggtaa tcgagttcgg gcaatgctga ccatcgggtt tgtttccggc tataaccgaa 
241 - cggtttgtgt acgggataca aatacaggga gggaagaagt aggcaaa/gg aaaaaatgtc 
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301 ' acatgatccg atcgctgccg acattggcac gcaagtgagc gacaacgctc tgcacggcgt 
36 J ' gacggccggc tcgacggcgc tgacgtcggt gaccgggctg gttcccgcgg gggccgatga 
421 ' ggtctccgcc caagcggcga cggcgttcac atcggagggc atccaattgc iggcttccaa 
481 - tgcatcggcc caagaccagc tccaccgtgc gggcgaagcg gtccaggacg tcgcccgcac 

5 541 - ctattcgcaa atcgacgacg gcgccgccgg cgtcttcgcc gaataggccc ccaacacatc 
601 - ggagggagtg atcac catgc tgtggcacgc aatgccaccg gagctaaata ccgcacggct 
661 - gatggccggc gcgggtccgg ctccaatgct tgcggcggcc gcgggatggc agacgctttc 
721 - ggcggctctg gacgctcagg ccgtcgagtt gaccgcgcgc ctgaactctc tgggagaagc 
781 - ctggactgga ggtggcagcg acaaggcgct tgcggctgca acgccgatgg tggtctggct 

10 841 - acaaaccgcg tcaacacagg ccaagacccg tgcgatgcag gcgacggcgc aagccgcggc 
901 - atacacccag gccatggcca cgacgccgtc gctgccggag atcgccgcca accacatcac 
961 - ccaggccgtc cttacggcca ccaacttctt cggtatcaac acgatcccga tcgcgttgac 
1021 - cgagatggat tatttcatcc gtatgtggaa ccaggcagcc ctggcaatgg aggtctacca 
1081 - ggccgagacc gcggttaaca cgcttttcga gaagctcgag ccgatggcgt cgatccttga 

15 1 141 - tcccggcgcg agccagagca cgacgaaccc gatcttcgga atgccctccc ctggcagctc 
1201 - aacaccggtt ggccagttgc cgccggcggc tacccagacc ctcggccaac tgggtgagat 
1261 ^ gagcggcccg atgcagcagc tgacccagcc gctgcagcag gtgacgtcgt tgttcagcca 
1321 - pgtgggcggc accggcggcg gcaacccagc cgacgaggaa gccgcgcaga tgggcctgct 
1381 - cggcaccagt ccgctgtcga accatccgct ggctggtgga tcaggcccca gcgcgggcgc 

20 1441 - gggcctgctg cgcgcggagt cgctacctgg cgcaggtggg tcgttgaccc gcacgccgct 
1501 - gatgtctcag ctgatcgaaa agccggttgc cccctcggtg atgccggcgg ctgctgccgg 
1561 - atcgtcggcg acgggtggcg ccgctccggt gggtgcggga gcgatgggcc agggtgcgca 
1621 - atccggcggc tccaccaggc cgggtctggt cgcgccggca ccgctcgcgc aggagcgtga 
1681 " agaagacgac gaggacgact gggacgaaga ggacgactgg t gagctcccg taatgacaac 

25 1741 - agacttcccg gccacccggg ccggaagact tgccaacatt ttggcgagga aggtaaagag 

1801 - agaaagtagt ccagcatggc agagatgaag accgatgccg ctaccctcgc gcaggaggca 
1861 - ggtaatttcg agcggatctc cggcgacctg aaaacccaga tcgaccaggt ggagtcgacg 
1921 - gcaggttcgt fgcagggcca gtggcgcggc gcggcgggga cggccgccca ggccgcggtg 



wo 03/085098 



PCT/IB03/01789 



12 . 

1981 - gtgcgcttcc aagaagcagc caataagcag aagcaggaac tcgacgagat ctcgacgaat 
2041 - attcgtcagg ccggcgtcca atactcgagg gccgacgagg agcagcagca ggcgctgtcc 
2101 - tcgcaaatgg gcttctgacc cgctaatacg aaaagaaacg gagcaaaaac atsacasa^c 
2161 - a^cagrf^saa tttc^csrs^si atcsas^cc^ c^^caasc ^c aatcca^si^a aamcac^t 
5 2221 - ccattcattc cctccttsac e^as^^ssaa^c aztcccmac caasctc^ ca srcs£cciss2 
2281 - £C£srtasc^2^ ttc^sas^^cs taccassst^ tccascdaaa atss^a c^cc acssctacc^ 
2341 - affctQaacaa cffc£ct£cag^ aaccts2C^c ssacsatca^ csaaB CC^st ca££caat^s 
2401 - cttcsaccsa a^^caac^tc act^^^am fcstccA sm^u. caacgccgag ttcgcgtaga 
2461 - atagcgaaac acgggatcgg gcgagttcga ccttccgtcg gtctcgccct ttctcgtgtt 
10 252 1 - tatacgtttg agcgcactct gagaggttgt catggcggcc gactacgaca agctcttccg 
2581 - gccgcacgaa ggtatggaag ctccggacga tatggcagcg cagccgttct tcgaccccag 
2641 - tgcttcgttt ccgccggcgc ccgcatcggc aaacctaccg aagcccaacg gccagactcc 
2701 - gcccccgacg tccgacgacc tgtcggagcg gttcgtgtcg gccccgccgc cgccaccccc 
2761 - acccccacct ccgcctccgc caactccgat gccgatcgcc gcaggagagc cgccctcgcc 
15 2821 - ggaaccggcc gcatctaaac cacccacacc ccccatgccc atcgccggac ccgaaccggc 
2881 - cccacccaaa ccacccacac cccccatgec ciatcgccgga cccgaaccgg ccccacccaa 
2941 - accacccaca cctccgatgc ccatcgccgg acctgcaccc accccaaccg aatcccagtt 
3001 - ggcgcccccc agacpaccga caccacaaac gccaaccgga gcgccgcagc aaccggaatc 
3061 - accggcgccc cacgtaccct cgcacgggcc acatcaaccc cggcgcaccg caccagcacc 
20 3 12 1 - gccctgggca aagatgccaa tcggcgaacc cccgcccgct ccgtccagac cgtctgcgtc 
3181 - cccggccgaa ccaccgaccc ggcctgcccc ccaacactcc cgacgtgcgc gccggggtca 
3241 - ccgctatcgc acagacaccg aacgaaacgt cgggaaggta gcaactggtc catccatcca 
3301 - ggcgcggctg cgggcagagg aagcatccgg cgcgcagctc gcccccggaa cggagccctc 
3361 - gccagcgccg ttgggccaac cgagatcgta tctggctccg cccacccgcc ccgcgccgac 
25 342 1 - agaacctccc cccagcccct cgccgcagcg caactccggt cggcgtgccg agcgacgcgt 
3481 - ccaccccgat ttagccgccc aacatgccgc ggcgcaacct gattcaatta cggccgcaaic 
3541 - cactggcggt cgtcgccgca agcgtgcagc gccggatctc gacgcgaciac agaaatcctf 
3601 - aaggccggcg gccaaggggc cgaaggtgaa gaaggtgaag ccccagaaac cgaaggccac 
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3661 - gaagccgccc aaagtggtgt cgcagcgcgg ctggcgacat tgggtgcatg cgttgacgcg 
3721 - aatcaacctg ggcctgtcac ccgacgagaa gtacgagctg gacctgcacg ctcgagtccg 
3781 - ccgcaatccc cgcgggtcgt atcagatcgc cgtcgtcggt ctcaaaggtg gggctggcaa 
3841 - aaccacgctg acagcagcgt tggggtcgac gttggctcag gtgcgggccg accggatcct 
5 3901 - ggctctaga 

pos. 0001-0006 EcoRI-restriction site 

pos. 0286-0583 Rv3872 coding for a FE'Protem(SEQID^o 15) 
pos. 0616-1720 Rv3873 coding for a PPE-Protein ( SEP ID No 16) 
10 pos. 1816-2115 Rv3874 coding for Culture Filtrat protein lOkD (CFPIO) (SEQ ID 
No 17) 

pos. 2151-2435 Rv3875 coding for Early Secreted Antigen Target 6kD (ESAT6) ( SEP 
ID No 18) 

pos. 3903-3609 Xbal-restriction site 
15 pos. 1 8 1 6-2435 CFP-IO gene + esat-6 gene (SEQ ID No 29). 

These sequences can be completed with the Rv3861 to Rv3871, and Rv3876 to Rv3885 
as referred in Table 1 below. 



Gene 
Name 


Gene 
length 


Protein 
length 


Gene 
type 


Accession 
number in 
NCBI 
Bank 

NC = gene 
NP 

protein 


Loc (kb) in 
M. 

tuberculosis 
H37RV 


Coordinates in 
Mycobacterium 
tuberculosis 
H37RV 


Molecular 
mass of 
protein 
(Dalton) 


Description 


Rv3861 


324 


108 


CDS 




4337.95 


4337946 .. 
4338269 


11643.42 


hypothetical 
protein 


Rv3862 
c- 

\vhiB6 


348 


116 


CDS 




4338,52 


compl 

4338174.. 

4338521 


12792.38 


possible 
transcriptional 
regulatoiy 
protein whiB- 
like WhiB6 


Rv3863 


1176 


392 


CDS 




4338.85 


4338849.. 
4340024 


41087.44 


hypothetical 
alanine rich 
protein 


Rv3864 


1206 


402 


CDS 




4340.27 


4340270.. 


42068.66 


conserved 
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4341475 




hypothetical 
protein 


Rv3865 


309 


103 


CDS 




4341.57 


4341566.. 
4341874 


10618.01 


conserved 

hypothetical 

protein 


Rv3866 


849 


283 


CDS 




4341.88 


4341880.. 
4342728 . 


30064.04 


conserved 

hypothetical 

protein 


Rv3867 


549 


183 


CDS 


NC 000962 
NP 218384 


4342.77 


4342767 
4343318 


19945.52 


conserved 
protein 


Rv3868 


1719 


573 


CDS 


NC 000962 
NP_218385 


43433 


4343311 
4345032 


62425.40 


conserved 
protein 


Rv3869 


1440 


480 


CDS 


NC 000962 
NP_218386 


4345.04 


4345036 
4346478 


51092 58 


possible 
conserved 

membrane 
protein 


Rv3870 


2241 


747 


CDS 


NC 000962 
NP__218387 


4346.48 


4346478 
4348721 


80912.76 


possible 
conserved 
membrane 
protein 


Rv3871 


1773 


591 


CDS 


NC 000962 
NPJ218388 


4348.83 


4348824 
4350599 


64560.65 


conserved 
protein 


Rv3876 


1998 


666 


CDS 


NC 000962 
NP_218393 


4353.01 


4353007 
4355007 


70644.92 


conserved 
proline and 
alanine rich 
protein 


Rv3877 


1533 


511 


CDS 


NC_000962 
NP_218394 


4355.01 


4355004 
4356539 . 


53981.12 


probable 
conserved 
transmembrane 
protein 


Rv387S 


840 


280 


CDS 


NC_0a0962 


4356.69 


4356693.. 
4357532 


27395.23 


conserved 
hypothetical 
alanine rich 
protein 


Rv3879 
c 


2187 


729 


CDS 


NC_000962 


4359.78 


compl. 

4357596.. 

4359782 


74492.13 


hypothetical 
alanine and 
proline rich 
protein 


Rv3880 
c 


345 


115 


CDS 


NC_000962 


4360.55 


compl. 

4360202.. 

4360546 


12167.51 


conserved 

hypothetical 

protein 


Rv3881 
c 


1380 


460 


CDS 


NCJ)00962 


4361.92 


compL 

4360546.. 

4361925 


47593.62 


conserved 
hypothetical 
alanine . and 
glycine rich 
protein 


Rv3882 
c 


1386 


462 


CDS 


NC_000962 


4363.42 


compl. 
4362035.. . 


50396.58 


possible conser 
ved membrane 
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4363420 




protein 


Rv3883 
c 


1338 


446 


CDS 


NC_000962 


4364.76 


compl. 

4363420.. 

4364757 


45085.89 


possible 
secreted 
protease 


Rv3884 
c- 


1857 


619 


CDS 


NC_000962 


4366.84 


compl. 

4364982.. 

4366S38 


68040.97 


probable 
CBXX/CFQX 
family protein 


Rv3885 
c 


1611 


537 


CDS 


NC_000962 


4368.52 


compl. 

4366911.. 

4368521 


57637.95 


possible 
conserved 

membrane 
protein 



The sequence of the fragment RD1-2F9 (-32 kb) covers the region of the M 
tuberculosis genome AL 123456 from ca 4337 kb to ca. 4369 kb, and also contains the 
5 sequence described in SEQ E) No 1. Therefore, the invention also embraces M bovis 
BCG::RD1 strain and M microtiiiRDl strain which have integrated the sequence as 
shown in SEQ H) No 1. 



Tlie above described strains fulfill the aim of the invention which is to provide an 
10 improved tuberculosis vaccine or M bovis BCG-based prophylactic or therapeutic agent, 
or a recombinant M microti derivative for these purposes. 

The above described M bovis BCG::RP1 strains are better tuberculosis vacciiies than M 
bovis BCG, These strains can also be improved by reintroducing other genes foimd in the 
15 RD8 and RD5 loci of M tuberculosis or any virulent member of the Mycobacterium 
tuberculosis complex (A£ qfiicanum, M. bovis, M canettii). These regions code for 
additional T-cell antigens. 

As indicated, overexpressing the genes contained in the RDl, RD5 and RD8 regions by 
means of exogenous promoters is encompassed by the invention. The same applies 
20 regarding M microtiv:KD\ strains. M microti strains could also be improved by 
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reintroducing the RD8 locus of M tuberculosis or any virulent member of the 
l^cobacterium tuberculosis complex (M ajHcanum^ M. bovis^ M. ccmettii). 

5 

In. a second embodiment, the invention is directed to a cosmid or a plasmid, more 
conmidnly named vectors, comprising all or part of the RD1-2F9 region originating 
from Mycobacterium tuberculosis or any virulent member of the Mycobacterium 
tuberculosis complex (M qfincanum^ M. boviSy M. canettii)^ said region comprising 

10 at least one, two, three or more gene(s) selected from Rv3861 (SEQ ID No 4), 

Rv3862 (SEQ ID No 5), Rv3863 (SEQ ID No 6), Rv3864 (SEQ ID No 7), Rv3865 
(SEQ ID No 8), Rv3866 (SEQ ID No 9), Rv3867 (SEQ ID No 10), Rv3868 (SEQ ID 
No 11), Rv3869 (SEQ ID No 12), Rv3870 (SEQ ID No 13), Rv3871 (SEQ ID No 
14), Rv3872 (SEQ ID No 15, mycobacterial PE), Rv3873 (SEQ ID No 16, PPE), 

15 Rv3874 (SEQ ID No 17, CFP-10), Rv3875 (SEQ ID No 18, ESAT.6), Rv3876 (SEQ 
ID No 19), Rv3877 (SEQ ID No 20), Rv3878 (SEQ ID No 21), Rv3879 (SEQ ID No 
22), RV3880 (SEQ ID No 23), Rv3881 (SEQ ID No 24), Rv3882 (SEQ ID No 25), 
Rv3883 (SEQ ID No 26), Rv3884 (SEQ ID No 27) and Rv3885 (SEQ ID No 28). 
The term "vector'' refers to a DNA molecule originatmg from a vims, a bacteria, or 

20 th(B cell of a higher organism into which another DNA fragment of appropriate size 
can be integrated without loss of the Vectors capacity for self-replication; a vector 
introduces foreign DNA into host cells, where it can be reproduced in large 
quantities. Examples are plasmids, cosmids, and yeast artificial chromosomes: 
vectors are often recombinant molecules containing DNA sequences from several 

25 sources. 



Preferably, a cosmid or a plasmid of the invention comprises a part of the RD1-2F9 
region originating from Mycobacteiium tuberculosis or any virulent member of the 
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Mycobacterium tuberculosis complex (M a/Hcanum, M. bovis, M. canettii), said part 
comprising at least one, two, three or more gene(s) selected from Rv3867 (SEQ ID No 
10), Rv3868 (SEQ ID No 11), Rv3869 (SEQ ID No 12), Rv3870 (SEQ ID No 13), 
Rv3871 (SEQ ID No 14), Rv3872 (SEQ ID No 15, mycobacterial PE), Rv3873 (SEQ ID 
5 No 16, PPE), Rv3874 (SEQ ID No 17, CFP-10), Rv3875 (SEQ ID No 18, ESAT-6), 
Rv3876 (SEQ ID No 19) and Rv3877 (SEQ ID No 20). 

Preferably, a cosmid of a plasmid of the invention comprises a part of the RD1-2F9 
region originating from Mycobacterium tuberculosis or any virulent member of the 
10 Afycobacterium tuberculosis complex (M qfricanum, M. bovis, M canettii), said part 
comprising at least one, two, tliree or more gene(s) selected from Rv3872 (SEQ ID No 
15, mycobacterial PE), Rv3873 (SEQ ID No 16, PPE), Rv3874 (SEQ ID No 17, CFP- 
10), Rv3875 (SEQ ID No 1 8, ESAT-6) and Rv3876 (SEQ ID No 1 9). 

15 Preferably, a cosmid or a plasmid of the invention comprises CFP-10, ESAT-6 or both 
or a part of them. It may also comprise a mutated gene selected CFP-10, ESAT-6 or 
both, said mutated gene being responsible for the improvied immunogenicity and 
decreased virulence. 

20 A cosmid or a plasmid as mentioned above may comprise at least four genes selected 
from Rv3861 (SEQ ID No 4), Rv3862 (SEQ ID No 5), Rv3863 (SEQ ID No 6), Rv3864 
(SEQ ID No 7), Rv3865 (SEQ ID No 8), Rv3866 (SEQ ID No 9), Rv3867 (SEQ ID No 
10), Rv3868 (SEQ ID No 11), Rv3869 (SEQ ID No 12), Rv3870 (SEQ ID No 13), 
Rv3871 (SEQ ID No 14), Rv3872 (SEQ ID No 15, mycobacterial PE), Rv3873 (SEQ ID 

25 No 16, PPE), Rv3874 (SEQ ID No 17, CFP-10), Rv3875 (SEQ ID No 18, ESAT-6), 
Rv3876 (SEQ ID No 19), Rv3877 (SEQ ID No 20), Rv3878 (SEQ ID No 21), Rv3879 
(SEQ ID No 22), Rv3880 (SEQ ID No 23), Rv3881 (SEQ ID No 24), Rv3882 (SEQ ID 
No 25), Rv3883 (SEQ ID No 26), Rv3884 (SEQ ID No 27) and Rv3885 (SEQ ID No 
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28), provided that it comprises Rv3874 (SEQ ID No 17, CFP-10) and/or Rv3875 (SEQ 
E)No 18,ESAT-6) 

Advantageously, a cosmid or a plasmid of the invention comprises a portion of DNA 
5 originating from Mycobacterium tuberculosis or any virulent member of the 
Mycobacteriian tuberculosis complex (M qfricanum, M bovis, M canettii), which 
comprises at least Rv3871 (SEQ ID No 14), Rv3875 (SEQ ID No 18, ESAT-6) and 
Rv3876 (SEQ ID No 19) or at least Rv3871 (SEQ ID No 14), Rv3875 (SEQ ID No 18, 
ESAT-6) and Rv3877 (SEQ ID No 20) or at least Rv3871 (SEQ ID No 14), Rv3875 
10 (SEQ ID No 1 8, ESAT-6), Rv3876 (SEQ ID No 19) and Rv3877 (SEQ ID No 20). 

The above cosmids or plasmids may further comprise Rv3872 (SEQ ID No 15, 
mycobacterial PE) Rv3873 (SEQ ID No 16, PPE) Rv3874 (SEQ ID No 17, CFP-10). It 
may also further comprise at least one, two, three or four gene(s) selected from Rv3861 

15 (SEQ ID No 4), Rv3862 (SEQ ID No 5), Rv3863 (SEQ ID No 6), Rv3864 (SEQ ID No 
7), Rv3865 (SEQ ID No 8), Rv3866 (SEQ ID No 9), Rv3867 (SEQ ID No 10), Rv3868 
(SEQ ID No 11), Rv3869 (SEQ ID No 12), Rv3870 (SEQ ED No 13), Rv3878 (SEQ ID 
No 21), Rv3879 (SEQ ID No 22), Rv3880 (SEQ ID No 23), Rv3881 (SEQ ID No 24), 
Rv3882 (SEQ ID No 25), Rv3883 (SEQ ID No 26), Rv3884 (SEQ ID No 27) and 

20 Rv3885(SEQIDNo28). 

Two particular cosmids of the invention are the cosmids herein referred as RD1-2F9 and 
RD1-AP34 contained in the E. coli strains deposited at the CNCM (Institut Pasteur, 25, 
liie du Docteur Roux, 75724 Paris cedex 15, France) under the accession number 1-2831 
25 and 1-2832 respectively. 

A particular plasmid or cosmid of the invention is one which has integrated the complete 
RD 1-2F9 region as shown in SEQ ID No 1 . 
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The invention also relates to the use of these cosmids or plasmids for transforming K 
bovis BCG or M microti strains. 

5 As indicated above, these cosmids or plasmids may comprise a mutated gene selected 
from Rv3861 to Rv3885, said mutated gene being responsible for the improved 
immunogenicily and decreased virulence. 

In another embodiment, the invention embraces a pharmaceutical composition 
10 comprising a strain as depicted above and a pharmaceutically acceptable carrier. 

In addition to the strains, these pharmaceutical compositions may contain suitable 
pharmaceutically-acceptable carriers comprising excipients and auxiliaries which 
facilitate processing of the living vaccine into preparations which can be used 
15 pharmaceutically. Further details on techniques for formulation and administration may 
be found in the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Easton, Pa.). 

Preferably, such composition is suitable for oral, intravenous or subcutaneous 
administration. 

20 The determination of the effective dose is well within the capability of those skilled in 
the art. A therapeutically effective dose refers to that amount of active ingredient, i.e the 
number of strains administered, which ameliorates the symptoms or condition. 
Therapeutic efficacy and toxicity may be determined by standard pharmaceutical 
procedures in experimental animals, e.g., ED50 (the dose therapeutically effective in 

25 50% of the population) and LD50 (the dose lethal to 50% of the population). The dose 
ratio of toxic to therapeutic effects is the therapeutic index, and it can be expressed as the 
ratio, LD50/ED5d. Pharmaceutical compositions which exhibit large therapeutic indices 
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are preferred. Of course, ED50 is to be modulated according to the mammal to be treated 
or vaccinated. In this regard, the invention contemplates a composition suitable for 
human administration as well as veterinary composition. 

The invention is also aimed at a vaccine comprising a M. bovis BCG::RD1 or M 
5 microti::RDl strain as depicted above and a suitable carrier. This vaccine is especially 
useful for preventing tuberculosis. It can also be used for treating bladder cancer. 

The M bovis BCG::RD1 or M jnicrotiziRDl strains are also useful as a carrier for the 
expression and presentation of foreign antigens or molecules of interest that are of 
therapeutic or prophylactic interest. Owing to its greater persistence, BCG::RD1 will 

10 present antigens to the infmiune system over a longer period thereby inducing stronger, 
more robust immune responses and notably protective responses. Examples of such 
foreign antigens can be found in patents and patent applications US 6,191,270 for 
antigen LSA3, US 6,096,879 and US 5,314,808 forHBV antigens, EP 201,540 for HXV- 
1 antigens, US 5,986,051 for H. pylori antigens and FR 2,744,724 for P. falciparum 

15 MSP-1 antigen: . 

The invention also concerns a product comprising a strain as depicted above and at least 
one protein selected from ESAT-6 and CFP-10 or epitope derived thereof for a separate, 
simultaneous or sequential use for treating tuberculosis. 

In still another embodiment, the invention concerns the use of a M, bovis BCGiiRDl or 
20 M microtiyJRDl strain as depicted above for preventing or treating tuberculosis. 

It also concerns the use of a M bovis BCG::RD1 or M microti::RDl stram as a powerful 
adjuvant/immunomodulator used in the treatment of superficial bladder cancer. 

The invention also contemplates the identification at the species level of members of the 
25 M tubercidosis complex by means of an RD-based molecular diagnostic test. Inclusion 
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of markers for RDl"'^^ and RDS"*'^ would improve the tests and act as predictors of 
virulence, especially in humans. 

In this regard, the invention concerns a diagnostic kit for the identification at the species 
level of members of the M tuberculosis complex comprising DNA probes and primers 
5 specifically hybridizing to a DNA portion of the RDl or RD5 region of M tuberculosis^ 
more particularly probes hybridizing under stringent conditions to a gene selected from 
Rv3871 (SEQ ID No 14), Rv3872 (SEQ ID No 15, mycobacterial PE), Rv3873 (SEQ ID 
No 16, PPE), Rv3874 (SEQ ID No 17, CFP-10), Rv3875 (SEQ ID No 18, ESAT-6), and 
Rv3876 (SEQ ID No 19), preferably CFP-10 and ESAT-6. 

10 As used herein, the term "stringent conditions" refers to conditions which permit 
hybridization between the probe sequences and the polynucleotide sequence to be 
detected. Suitably stringent conditions can be defined by, for example, the 
concentrations of salt or formamide in the prehybridization and hybridization solutions, 
or by the hybridization temperature, and are well known in the art. In particular, 

15 stringency can be increased by reducing the concentration of salt, increasing the 
concentration of formamide, or raising the hybridization temperature. The temperature 
range corresponding to a particular level of stringency can be further narrowed by 
calculating the purine to pyrimidine ratio of the nucleic acid of interest and adjusting the 
temperature accordingly. Variations on the above ranges and conditions are well known 

20 in the art. 

Among the preferred primers, we can cite: 

primer esat-6F GTCACGTCCATTCATTCCCT (SEQ ID No 32), 

primer esat-6R ATCCCAGTGACGTTGCCTT) (SEQ ID No 33), 
. 25 primer RDr"*" flanking region F GCAGTGCAAAGGTGCAGATA (SEQ ID No 34), 
primer RD 1'"'*' flanking region R GATTGAGACACTTGCCACGA (SEQ ID No 35), 
primer RD5"^^ flanking region F GAATGCCGACGTCATATCG (SEQ ID No 39),' 
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primer RDS*"^^ flanking region R CGGCCACTGAGTTCGATTAT (SEQ ID No 40). 

The present invention covers also the complementary nucleotide sequences of said above 
primers as well as the nucleotide sequences hybridizing under stringent conditions with 
5 them and having at least 20 nucleotides and less than 500 nucleotides. 

Diagnostic kits for the identification at the species level of members of the M 
tuberculosis complex comprising at least one, two, three or more antibodies directed to 
mycobacterial PE, PPE, CFP-10, ESAT-6, are also embraced by the invention. 

Preferably, such kit comprises antibodies directed to CFP-10 and ESAT-6. 

As used herein, the term "antibody" refers to intact molecules as well as fragments 
thereof, such as Fab, F(ab').sub.2, and Fv, which are capable of binding the epitopic 
15 determinant. Probes or antibodies can be labeled with isotopes, fluorescent or 
phosphorescent molecules or by any other means known in the art. 

The invention also relates to virulence markers associated with RDl and/or RD5 regions 
of the genome of M tuberculosis or a part of these regions. 

20 

The invention is further detailed below and will be illustrated with the following figures. 
Figure legends 

Figure 1: M. bovis BCG and M. microti have a chromosomal deletion, RDl, 
spanning the c/^i<?-e5fl/tf locus. 
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. . (A) Map of the cjplO-esatS region showing the six possible reading. frames and the M 
tuberculosis H37Rv gene predictions. This map is also available at: 
(http://genolist.pasteur.fr/TubercuList/) , 

The deleted regions are shown for BCG and M microti with their respective H37Rv 
5 genome coordinates, and the extent of the conserved ESAT-6 locus (F. Tekaia, et aly 
Tubercle Lung Disease 79, 329 (1999)^, is indicated by the gray bar. 

(B) Table showing characteristics of deleted regions selected for complementation 
analysis. Potential virulence factors and their putative ftirictions disrupted by each 
deletion are shown. The coordinates are for the M, tuberculosis H37Rv genome. 

10 (C) Clones used to complement BCG. Individual clones spanning RDl regions (RDl- 
1106 and RD1-2F9) were selected fronl an ordered M, tuberculosis genomic library (R.B. 
unpublished) in pYUB412 (S. T, Cole, et al. Nature 393, 537 (1998) and W. R. Bange, 
F. M. Collins, W. R. Jacobs, Jr., Tuber. Lung Dis. 79, 171 (1999)) and electroporated 
into M bovis BCG strains, or M. microti. Hygromycin-resistant transformants were 

15 verified using PCR specific for the corresponding genes. pAP35 was derived from RDl- 
2F9 by excision of an AfM fragment. pAP34 was constructed by subcloning an EcoKl' 
Xbal fragment into the integrative vector pKINT. The ends of each firagment are related 
to the BCG RDl deletion (shaded box) with black lines and the H37Rv coordinates for 
the other fragment ends given in kilobases. 

20 (D) Immunoblot analysis, using an ESAT-6 monoclonal antibody, of whole cell protein 
extracts from log-phase cultures of (well n^'l) H37Rv (S. T. Cole, et al. Nature 393, 537 
(1998)), (n^2) BCG::pYUB412 (M. A. Behr, et al. Science 284, 1520 (1999)), (n°3) 
BCG::RD1-I106 (R. Brosch, et al. Infection Immun. 66, 2221 (1998)), (n°4) 
BCG::RD1-2F9 (S. V. Gordon, et al, Molec Microbiol 32, 643 (1999)), (n°5) M bovis 

25 (H. Salamon et al. Genome Res 10, 2044 (2000)), (n**6) Mycobacterium smegmatis (G. 
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G. Mahairas, et al, J. Bactenol 178, 1274 (1996)), (n°7) M smegmatis::pYUB412, and 
(n^8) M smeginatis" RD1-2F9 (R. Brosch, et al, Proc Natl Acad Sci USA 99, 3684. 
(2002)). 

Figure 2: Complementation of BCG Pasteur with the RDl region alters the colony 
5 morphology and leads to accumulation of Rv3873 and ESAT-6 in the cell wall. 

(A) Serial dilutions of 3 week old cultures of BCG::pYUB412, BCG::I106 or 
BCG::RDi-2F9 growing on Middlebrook 7H10 agar plates. The white square shows the 
area of the plate magnified in the image to the right. 

(B) Light microscope image at fifty fold magnification of BCG::pYUB412 and 
10 BCG::RD1-2F9 colonies. 5 \i\ drops of bacterial suspensions of each strain were spotted 

adjacently onto 7H10 plates and imaged after 10 days growth, illuminating the colonies 
through the agar. 

(C) Immunoblot analysis of different cell fractions of H37Rv obtained from 
http://www.cvmbs.colostate.edu/microbiologv/tb/ResearchMA.html using either an anti- 

15 ESAT-6 antibody or 

(D) anti-Rv3873 (PiPE) rabbit serum. H37Rv and BCG signify whole cell extracts from 
the respective bacteria and Cyt, Mem and CW correspond to the cytosolic, membrane 
and cell wall fractions of M tuberculosis H37Rv. 

Figure 3: Complementation of BCG Pasteur with the RDl region increases 
20 bacterial persistence and pathogenicity in mice. 

(A) Bacteria in the spleen and lungs of BALB/c mice following intravenous (i.v.) 
infection via the lateral tail vein with 10^ colony forming units (cfii) of M tuberculosis 
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H37Rv (black) or 10^ cfu of either BCG::pYUB412 (light grey) or BCG::RD1-I106 
(grey)- 

(B) Bacterial persistence in the spleen and lungs of C57BL/6 mice following i.v. 
infection with 10^ cfii of BCG::pYUB412 (light grey), BCG::RD1-I106 (middle grey) or 

5 BCG::RD 1-2F9 (dark grey). 

(C) Bacterial multiplication after i.v. infection with 10^ cfii of BCG::pYUB412 (light 
grey) and BCG::RD1-2F9 (grey) in severe combined immunodeficiency mice (SCE)). 
For A, B, and C each timepoint is the mean of 3 to 4 mice and the error bars represent 
standard deviations. 

10 (D) Spleens from SCDD mice three weeks after i.v. infection with 10^ cfu of either 
BCG::pYUB412, BCG::RD1-2F9 or BCG::I301 (anJRD3 "knock-m". Fig. IB). The. 
. scale is in cm. 

Figure 4: Immunisation of mice with BCG::RD1 generates marked ESAT-6 specific 
T-cell responses and enhanced protection to a challenge with M tuberculosis. 

15 (A) Proliferative response of splenocytes of C57BL/6 mice immunised subcutaneously 
(s.c.) with 10^ CFU of BCG::pYUB412 (open squares) or BCG::RD1-2F9 (solid 
squares) to in vitro stimulation with various concentrations of synthetic peptides from 
poliovirus type 1 capsid protein VPl, ESAT-6 or Ag85A (K. Huygen, et al.. Infect, 
Immun. 62, 363 (1994), L. Brandt, J .Immunol 157, 3527 (1996) and C. Leclerc et al, J. 

20 r7ro/. 65,711(1991);. 

(B) Proliferation of splenocytes from BCG::RDl-2F9-immunised mice iii the absence or 
pi-eserice of 10 |ig/ml of ESAT-6 1-20 peptide, with or without 1 ng/ml of anti-CD4 
(GK1.5) oir anti-CD8 (H35-17-2) monoclonal antibody. Results are expressed as mean 
and standard deviation of ^H-thymidine incoxporation from duplicate wells. 
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(C) Concentration of IFN-y in culture supematants of splenocytes of C57BL/6 mice 
stimulated for 72 h with peptides or PPD after s.c. or i.v. immunisation with either 
BCG::pYUB412 (middle grey and white) or BCG::RD1-2F9 (light grey and black). 
Mice were inoculated with either 10^ (white and light grey) or 10^(middle grey and 

5 black) cfii. Levels of IFN-y were quantified using a sandwich ELISA (detection limit of 
500 pg/ml) with the mAbs R4-6A2 and biotin-conjugated XMG1.2. Results are 
expressed as the mean and standard deviation of duplicate culture wells. 

(D) Bacterial counts in the spleen and lungs of vaccinated and unvaccinated BALB/c 
mice 2 months after an i.v: challenge with M tuberculosis H37Rv. The mice were 

10 challenged 2 months after i.v. inoculation with 10^ cfii of either BCG::pYUB412 or 
BCG::RD1-2F9. Organ homogenates for bacterial enumeration were plated on 7H11 
medium, with or without hygromycin, to differentiate M tubei'culosis from residual 
BCG colonies. Results are expressed as the mean and standard deviation of 4 to 5 mice 
and the levels of significance derived using the Wilcoxon rang sum test. 

15 

Figure 5: Mycobacterium microti strain OV254 BAG map (BAG clones named 
MiXXX, where XXX is the identification number of the clone), overlaid on the -M 
tuberculosis H37Rv (BAG clones named RvXXX, where XXX is the identification 
number of the clone) and M bovis AF2122/97 (BAG clones named MbXXX, where 
20 XXX is the identification number of the clone) BAG maps. The scale bars indicate the 
position on the M tuberculosis genome. 

Figure 6: Difference in the region 4340-4360 kb between the deletion in BGG RDl*'''^ 
(A) and in M microH RDl'"*'' (C) relative, to M tuberculosis H37Rv (B). 
25 ' ■ • 

Figure 7: Difference in the region 3121-3127 kb between M tuberculosis H37Rv-(A) 
and M microti OV254 (B). Gray boxes picture the direct repeats (PR), black ones the 
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unique numbered spacer sequences. * spacer sequence identical to the one of spacer 58 
reported by van Embden et aL (42). Note that spacers 33-36 and 20-22 are not shown 
because H37Rv lacks these spacers. 

5 Figure 8: A) Asel PFGE profiles of various M microti strains; Hybridization with a 
radiolabeled B) esat-S probe; C) probe of the RDl""'*" flanking region; D) plcA probe. 1. 
M. bo\ns AF2122/97, 2. M canetti, 3. M bovis BCG Pasteur, 4. M tuberculosis H37Rv, 
5. M f7ucroti OV254, 6. M microti Myc 94-2272, 7. M microti B3 type mouse, 8. M 
microti B4 type mouse , 9. M microti B2 type llama, 10. M microti Bl tjpe llama, 11. 

10 M AtCC 35782. M: Low range PFGE marker (NEB). 

Figure 9: PGR products obtained from various M w/crorf strains using primers that 
flank the RDl""^*" region, for amplifying ESAT-6 antigen, that flank the MiD2 region. 1. 
M microti Bl type llama, 2. M microti B4 type mouse, 3. M microti B3 type mouse, 4. 
15 M /72zcror/ B2 type llama, 5. M /Ti/crof/ ATCC 35782, 6. M wwcrarf OV254, 7. M 
microti Myc 94-2272, 8. M tuberculosis H37Rv. 

Figure 10: Map of the M tuberculosis H37Rv RDl genomic region. Map of the 
fragments used to complement BCG and M microti (black) and the genomic regions 

20 deleted from different mycobacterial strains (grey). The middle part shows key genes, 
putative promoters (P) and transcripts, the various proteins from the RDl region, their 
sizes (number of amino acid residues), InterPro domains 
(http://www-ebi.ac.uk/interpro/), membership of M tuberculosis protein families from 
TubercuList (http://genolist.pasteur.fr/TubercuList/). The dashed lines mark the extent of 

25 the RDl deletion in BCG, M microti and M tuberculosis clinical isolate MT56 (Brosch, 
K, et ah A new evolutionary scenario for the Afycohacterium tuberculosis complex. Proc NcdlAcad Sci U 
SA 99, 3684-9. (2002)). M bovis AF2 122/97 is shown because it contains a frameshift 
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mutation in RvSSSl, a gene flanking the RDl region of BCG. The fragments are drawn 
to show their ends in relation to the genetic map, unless they extend beyond the genomic 
region indicated. pRDl-2F9, pRDl-1106 and pAP35 are based on pYUB412; pAP34 on 
pKINT; pAP47 and pAP48 on pSMSl. 

5 Figure 11: Western blot analysis of various RDl knock-ins of M bovis BCG and M. 
jnicrotu The left panel shows results of immunodetection of ESAT-6, CFP-10 and 
PPE68 (Rv3873) in whole cell lysates (WCL) and culture supematants of BCG; the 
centre panel displays the equivalent findings from M microti and the right panel 
contains M tuberculosis H37Rv control samples. Samples from mycobacteria 

10 transformed with the following plasmids were present in lanes: pYlJB412 vector 
control; 1, pAP34; 2, pAP35; 3, RD1-I106; 4, RD1-2F9. The positions of the nearest 
molecular weight markers are indicated- 
Figure 12: Analysis of immune responses induced by BCG recombinants. A, The upper 
three panels display the results of splenocyte proliferation assays in response to 

15 stimulation in vitro with a peptide from MalE (negative control), to PPD or to a peptide 
containing an immunodominant CD4-epitope from ESAT-6. B, The lower panel shows 
IFN-y production by splenocytes in response to the same antigens. Symbols indicate the 
nature of the various BCG transformants. Samples were taken from C57BL/6 mice . 
immunised subcutaneously. 

20 Figure 13: Further immunological characterization of responses to BCG::RD1-2F9 A, 
Proliferative response of splenocytes of C57BL/6 mice immunised subcutaneously (s.c.) 
with 10^ CFU of BCG::pYUB412 or BCG::RD1-2F9 to in vitro stimulation with various 
concentrations of synthetic peptides from poliovirus type 1 capsid protein VPl (negative 
control), ESAT-6 or Ag85A (see Methods for details). B, Proliferation of splenocytes 

25 from BCG::RDl-2F9-immunised mice in the absence or presence of ESAT-6 1-20 
peptide, with or without anti-CD4 or ahti-CD8 monoclonal antibody. Results are 
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expressed as mean and standard deviation of ^H-thymidine incorporation from duplicate 
. wells, c. Concentration of IFN~y in culture supematants of splenocytes of C57BL/6 mice 
stimulated for 72 h with peptides or PPD after s.c. or i.v. immunisation with either 
BCG::pYUB412 or BCG::RD1-2F9- Mice were inoculated with either 10^ or lO'^ CPU. 
5 Results are expressed as the mean and standard deviation of duplicate culture wells. 

Figure 14: Mouse protection studies. A, Bacterial counts in the spleen and lungs of 
vaccinated and unvaccinated C57BL/6 mice 2 months after an i.v. challenge with M 
tuberculosis H37Rv. The mice were challenged 2 months after i.v. inoculation with 10^ 
cfii of either BCG::pYUB412 or BCG::RD1-2F9. Organ homogenates for bacterial 

10 enumeration were plated on 7H11 medium, with or without hygromycin, to differentiate 
M tuberculosis from residual BCG colonies. Results are expressed as the mean and 
standard deviation of 4 mice. Hatched columns correspond to the cohort of imvaccinated 
mice, while white and black columns correspond to mice vaccinated with 
BCG::pYUB412 and BCG::RD1-2F9, respectively. B, Bacterial counts in the spleen and 

15 lungs of vaccinated and unvaccinated C57BL6 mice after an aerosol challenge with 1000 
CPUs of M tuberculosis. All mice were treated with antibiotics for three weeks prior to 
infection with M tuberculosis. Data are the mean and SE measured on groups of three 
animals, and differences between groups were analysed using ANOVA (*p<0.05, 
**p<0.01). 

20 Figure 15: Guinea pig protection studies. A, Mean weight gain of vaccinated and 
unvaccinated guinea pigs following aerosol infection with M tuberculosis H37Rv. 
Guinea pigs were vaccinated with either saline (triangles), BCG (squares) or BCG::RD1- 
2P9 (filled circles). The error bars are the standard error of the mean. Each time point 
represents the mean weight of six guinea pigs. Por the saline vaccinated group the last 

25 live weight was used for calculating the means as the animals were killed on signs of 
severe tuberculosis which occurred after 50, 59, 71, 72, 93 and 93 days. B, Mean 
bacterial counts in the spleen and lungs of vaccinated and unvaccinated guinea pigs after 
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an aerosol challenge with M tuberculosis H37Rv. Groups of 6 guinea pigs were 
vaccinated subcutaneously with either saline, BCG or BCG::RD1-2F9 and infected 56 
days later. Vaccinated animals were killed 120 days following infection and 
unvaccinated ones on signs of suffering or significant weight loss. The error bars 
5 represent the standard error of the rriean of six observations. C, Spleens of vaccinated 
guinea pigs 120 days after infection with M tuberculosis H37Rv; left, animal 
immunised with BCG; right, animal immunised with BCG::RD1-2F9. 

Figure 16: Diagram of the M.tuberculosis H37Rv genomic region showing a working 
model for biogenesis and export of ESAT-6 proteins. It presents a possible fimctional 

10 model indicating predicted subcellular localization and potential interactions within the 
mycobacterial cell envelope. RosettaL stone analysis indicates direct interaction between 
proteins Ry3870 and Rv3871,and the sequence similarity between the N-terminal 
domains of Rv3868 and Rv3876 suggests that these putative chaperones might also 
interact. Rv3868 is a member of the AAA-family of ATPases that perform chaperone- 

15 like fimctions by assisting in the assembly, and disassembly of protein complexes 
(Neuwald, A.F., Aravind, L., Spouge, J.L. & Koonin, E.V. AAA+: A class of chaperone- 
like ATPases associated with the assembly, operation, and disassembly of protein 
complexes. Genome Res 9, 27-43. (1999).). It is striking that many type m secretion 
systems require chaperones for stabilisation of the effector proteins that they secrete and 

20 for prevention of premature protein-protein interactions (Page, A.L. & Parsot, C. 
Qiaperones of the type in secretion pathway: jacks of all trades. Mol Microbiol 46, 1- 11 . 
(2002).). Thus, Rv3868, and possibly Rv3876,may be required for the folding and/or 
dimerisation of ESAT-6/CFP-10 proteins (Renshaw, P.S., et al Conclusive evidence 
that the major T-cell antigens of the M tuberculosis complex ESAT-6 and CFP-10 form 

25 a tight, 1 : 1 complex and characterisation of the structural properties of ESAT-6, CFP-10 
and the ESAT-6-CFP-10 complex: implications for pathogenesis and virulence. J Biol 
Chem 8, 8 (2002).),or even to prevent premature dimerisation. ESAT-6/CFP-10 are 
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predicted to be exported through a transmembrane channel,consisting of at least Rv3870, 
Rv3871, and Rv3877,and possibly Rv3869, in a process catalysed by ATP-hydroIysis. 
Rv3873 (PPE 68) is known to occur in the cell envelope and may also be involved as 
shown herein. 

5 

Example 1: preparation and assessment of M bovis BCG::RD1 strains as a vaccine 
for treating or preventing tuberculosis. 

As mentioned above, we have found that complementation with RDl was accompanied 
by a change in colonial appearance as the BCG Pasteur "knock-in" strains developed a 

10 strikingly different morphotype (Fig. 2A). The RDl complemented strains adopted a 
spreading, less-rugose morphology, that is characteristic of M bovis, and this was more 
apparent when the colonies were inspected by light microscopy (Fig. 2B). Maps of the 
clones used are shown (Fig. IC). These changes were seen following complementation 
with all of the RDl constructs (Fig. IC) and on complementing M microti (data not 

15 shown). Pertinently, Calmette and Guerin (A. Cabnette, La vaccination preventive 
contre la tuberculose. (Masson et cie., Paris, 1927P observed a change in colony 
morphology during tiieir initial passaging of M bovis, and our experiments now 
demonstrate that this change, corresponding to loss of RDl, directly contributed to 
attenuating this virulent strain. The integrity of the cell wall is known to be a key 

20 virulence determinant for M tuberculosis (C, E. Barry, Trends Microbiol 9, 237 (200 ip, 
and changes in both cell wall lipids (M, S. Glickman, J. S. Cox, W. R. Jacobs, Jr., Mol 
Cell 5, 717 (2000); and protem (F. X. Berthet, et al. Science 282, 759 (1998); have been 
shown to siter colony inorphology and diminish persistence in animal models. 

To determine which genes were implicated in these morphological changes, antibodies 
25 recognising three RDl proteins (Rv3873, CFPIO and ESAT-6) were used in 
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immimocytological and subcellular fractionation analysis. When the different cell 
fractions from M tuberculosis were immunoblotted all three proteins were localized in 
the cell wall fraction (Fig. 2C) though significant quantities of Rv3873, a PPE protein, 
were also detected in the membrane and cytosolic fractions (Fig. 2D). Using 
5 immunogold staining and electron hiicroscopy the presence of ESAT-6 in the envelope 
of M tuberculosis was confirmed but no alteration in capsular ultrastructure could be 
detected (data not shown). Previously, CFP-10 and ESAT-6 have been considered as 
secreted proteins (P. X. Berthet et al. Microbiology 144, 3195 (1998)^ but our results 
suggest that their biological functions are linked directly with the cell wall. 

10 Changes in colonial morphology are often accompanied by altered bacterial virulence. 
Initial assessment of the growth of different BCG::RD1 "knock-ins" in C57BL/6 or 
BALB/c mice following intravenous infection revealed that complementation did not 
restore levels of virulence to those of the reference strain M tuberculosis H37Rv (Fig. 
3A). In longer-term experiments, modest yet significant differences were detected in the 

15 persistence of the BCG::RD1 "knock-ins" in comparison to BCG controls. Following 
intravenous infection of C57BL/6 mice, only the RDl "knock-ins" were still detectable 
in the lungs after 106 days (Fig. 3B). This difference in virulence between the RDl 
recombinants and the BCG vector control was more pronounced in severe combined 
immunodeficiency (SCID) mice (Fig. 3C). The BCG::RD1-2F9 "knock-in" was 

20 markedly more virulent, as evidenced by the growth rate in lungs and spleen and also by 
an increased degree of splenomegaly (Fig. 3D). Cytological examination revealed 
numerous bacilli, extensive cellular infiltration and granuloma formation. These 
increases in virulence following complementation with the RDl region, demonstrate that 
the loss of this genomic locus contributed to the attenuation of BCG. 

25 The inability to restore fiill virulence to BCG Pasteur was not due to instability of our 
constructs nor to the strain used (data not shown). Essentially identical results were 
obtained on coraplementmg BCG Russia, a strain less passaged than BCG Pasteur and 
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presumed, therefore, to be closer to the original ancestor (M. A. Behr, et al.^ Science 284, 
1520 (1999X>. This indicates that the attenuation of BCG was a polymutational process 
and loss of residual virulence for animals was documented in, the late 1920s (T. 
Oettinger, et al. Tuber LimgDis 79, 243 (1999)/ Using the same experimental strategy, 
5 we also tested the effects of complementing with RD3-5, RD7 and RD9 ^. T. Cole, et 
al. Nature 393, 537 (1998) ; M. A. Behr, et al. Science 284, 1520 (1999) ; R. Brosch, et 
al, Ifrfection Immun, 66, 2221 (1998) and S. V. Gordon et al, Molec Microbiol 32, 643 

(1999) ) encoding putative virulence factors (Fig. IB). Reintroduction of these regions, 
which are not restricted to avirulent strains, did not affect virulence in immuno- 

10 competent mice. Although it is possible that deletion effects act synergistically it seems 
more plausible that other attenuating mechanisms are at play. 

Since RDl encodes at least two potent T-cell antigens (K. Colangelli, et ah^ Infect. 
Immim, 68^ 990 (2000), M. Harboe, et al.. Infect Immun. 66, 717 (1998) and R. L. V. 
Skj0t, et al. Infect Immun. 68, 214 (2000)), we investigated whether its restoration 

15 induced immune responses to these antigens or even improved the protective capacity of 
BCG. Three weeks following either intravenous or subcutaneous inoculation with 
BCG::RD1 or BCG controls, we observed similar proliferation of splenocytes to an 
Ag85A (an antigenic BCG protein) peptide (K. Huygen, et aL, Infect Immun, 62, 363 
(1994)^, but not against a control viral peptide (Fig, 4A). Moreover, BCG::RD1 

20 generated powerful CiD4^ T-cell responses against the ESAT-6 peptide as shown by 
splenocyte proliferation (Fig. 4A, B) and strong IFN-y production (Fig. 4C). In contrast, 
the BCG::pYUB412 control did not stimulate ESAT-6 specific T-cell responses ttius 
indicating that these were mediated by the RDl locus* ESAT-6 is, therefore, highly 
immunogenic in mice in the context of recombinant BCG. 

25 When used as a subunit vaccine, ESAT-6 elicits T-cell responses aiid induces levels of 
protection weaker tiian btit akiii to those of BCG (L. Brandt et al. Infect Immtm. 68, 791 

(2000) ^. Challenge experiments were conducted to determine if induction of immune . 
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responses to BCG::RDl>encoded antigens, such as ESAT-6, could improve protection 
against infection vvith M tuberculosis. Groups of mice inoculated with either 
BCG::pYUB412 or BCG::RD1 w^e subsequently infected intravenously with M 
tuberculosis H37Rv. These experiments showed that immunisation with the BCG::RD1 
5 "knock-in" inhibited the growth of M tuberculosis within both BALB/c (Fig- 4D) and 
C57BL/6 mice when compared to inoculation with BCG alone. 

Although the increases in protection induced by BCG::RD1 and the BCG control are 
modest they demonstrate convincmgly that genetic differences have developed between 
the live vaccine and the pathogen which have weakened tlie protective capacity of BCG. 

10 This study therefore defines the genetic basis of a compromise that has occurred, during 
the attenuation process, between loss of virulence and reduced protection (M. A. Behr, P. 
M. Small, Nature 389, 133 (1997P: The strategy of reintroducing, or even 
overproducing (M. A. Horwitz et al, Proa Natl Acad Sci USA91, 13853 (2000);, the 
missing immunodominant antigens of M tuberculosis in BCG, could be combined with 

15 an immuno-neutral attenuating mutation to create a more efficacious tuberculosis 
vaccine. 

Example 2: BAC based comparative genomics XAen^Ties Mycobacterium microti as a 
natural ESAT«6 deletion mutant 

20 

We searched for any genetic differences between human and vole isolates that might 
explain their different degree of virulence and host preference and what makes the vole 
isolates harmless for humans. In this regard, comparative genomics methods were 
employed in connection with the present invention to identify major differences that may 
25 exist between the M microti reference strain OV254 and the entirely sequenced strains 
of M tuberculosis H37Rv (10) or M bovis AF2122/97 (14). An ordered Bacterial 
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Artificial Chromosome (BAC) library of M microti OV254 was constructed and 
individual BAC to BAC comparison of a minimal set of these clones with BAC clones 
from previously constructed libraries of M tuberculosis H37Rv and M bovis AF2 122/97 
was undertaken. 

5 Ten regions were detected in M. microti that were different to the corresponding 
genomic regions in M. tuberculosis and M bovis. To investigate if these regions were 
associated with the ability of M microti strains to infect humans, their genetic 
organization was studied in 8 additional M microti strains^ including those isolated 
recently from patients with pulmonary tuberculosis. This analysis identified some 
10 regions that were specifically absent from all tested M microti strains, but present in all 
other members of the M tuberculosis complex and other regions that were only absent 
from vole isolates of M microti. 

2.1 MATERIALS AND METHODS 

Bacterial strains and plasmids. M microti OV254 which was originally isolated from 
voles in the UK in the 1930*s was kindly supplied by MJ Colston (45). DNA from M 
microti OV2I6 and OV183 were included in a set of strains used during a multicenter 
study (26). M. microti Myc 94-2272 was isolated in 1988 from the perfiision fluid of a 

20 41 -year-old dialysis patient (43) and was kindly provided by L. M. Parsons. M, microti 
35782 was purchased from American Type Culture Collection (designation TMC 1608 
(M.P. Prague)). M, microti Bl type llama, B2 type llama, B3 type mouse and B4 type 
niouse were obtained from the collection of the National Reference Center for 
Mycobacteria, Forschungszentrum Borstel, Germany. M bovis strain AF2 122/97, 

25 spoligotype 9 was responsible for a herd outbreak in Devon in the UK and has been 
isolated from iesions in both cattle and badgers. Typically, mycobacteria were grown on 
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7H9 Middlebrook liquid medium (Difco) containing 10% oleic-acid-dextrose-catalase 
(Difco), 0.2 % pyruvic acid and 0.05% Tween 80. 

Library construction, preparation of BAC DNA and sequencing reactions. 
5 Preparation of agarose-embedded genomic DNA from M microti strain OV254, M. 
tuberculosis H37Rv, M bovis BCG was performed as described by Brosch et al. (5). The 
M, microti library was constructed by ligation of partially digested HindBI fragments 
(5D-125 kb) into pBeloBAClL From the 10,000 clones that were obtained, 2,000 were 
picked into 96 well plates and stored at -SO^'G. Plasmid preparations of recombinant 

10 clones for sequencing reactions were obtained by pooling eight copies of 96 well plates, 
with each well containing an overnight culture in 250 (xl 2YT rriedium with 12.5 [ig.mr^ 
chloramphenicoL After 5 min centrifugation at 3000 rpm^ the bacterial pellets were 
resuspended in 25 \il of solution A (25 mM Tris, pH 8.0, 50 mM glucose and 10 mM 
EDTA), cells were lysed by adding 25 ^1 of buffer B (NaOH 0.2 M, SDS 0.2%). Then 

15 20 \il of cold 3 M sodium acetate pH 4,8 were added and kept on ice for 30 min. After 
centriftigation at 3000 rpm for 30 min, the pooled supematants (140 ^il) were transferred 
to new plates. 130 jil of isopropanol were added, and after 30 min on ice, DNA was 
pelleted by centriftigation at 3500 rpm for 15 min. The supernatant was discarded and 
the pellet resuspended in 50 p,l of a 10 |ig/ml RNAse A splution (in Tris 10 mM pH 7.5 / 

20 EDTA 10 mM) and incubated at 64®C for 15 min. After precipitation (2.5 p.1 of sodium 
acetate 3 MpH 7 and 200 \i\ of absolute ethanol) pellets were rinsed with 200 ]il of 70% 
ethanol, air dried and finally suspended in 20 |il of TE bufier. 

End-sequencing reactions were performed with a Tag DyeDeoxy Terminator cycle 
25 sequencing kit (Applied Biosystems) using a mixture of 13 |il of DNA solution, 2 jil of 
Primer (2 iiM) (SP6-BAG1, AGTTAGCTCACTCATTAGGCA (SEQ ID No 15), or T7- 
BACl, GGATGTGCTGCAAGGCGATTA (SEQ ID No 16)), 2.5 jil of Big Dye and 2.5 
III of a 5X buffer (50 mM MgCl2, 50 mM Tris). Thermal cycling was performed on a 
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PTC-100 amplifier (MJ Inc.) with an initial denaturation step of 60 s at 95°C, followed 
by 90 cycles of 15 s at 95°C, 15 s at 56^0, 4 min at 60''C. DNA was then precipitated 
with 80 \il of 76% ethanol and centrifiiged at 3000 rpm for 30 min. After discarding the 
supernatant, DNA was finally rinsed with 80 iil of 70% ethanol and resuspended in 

5 appropriate buffers depending on the type of automated sequencer used (ABI 377 or ABI 
3700). Sequence data were transferred to Digital workstations and edited using the TED 
software jfrom the Staden package (37). Edited sequences were compared against the M 
tuberculosis H37Rv database (http://genolist.pasteur.fr/TubercuList/), the M bovis 
BLAST server (http://www.sanger.ac.Uk/Proiects/M bovis/blast server.shtml \ and in- 

10 house databases to determine the relative positions of the M microti OV254 BAG end- 
sequences. 

Preparation of BAG DNA from recombinants and BAG digestion profile 
comparison. DNA for digestion was prepared as previously described (4). DNA (1 |ig) 

15 was digested with HindUl (Boehringer) and restriction products separated by pulsed-field 
gel electrophoresis (PFGE) on a Biorad CHEF-DR HI system using a 1% (w/v) agarose 
gel and a pulse of 3.5 s for 17 h at 6 V.cm*^ Low-range PFGE markers (NEB) were used 
as size standards. Insert sizes were estimated after ethidium bromide stainmg and 
visualization with UV light Different comparisons were made with overlapping clones 

20 fi-om the M microti OV254, M bovis AF2122/97, and M tuberculosis H37Rv 
pBeloBACU libraries. 

PGR analysis to determine presence of genes in different M. microti strains* 
Reactions contained 5 ^il of lOxPCR buffer (100 mM B-mercaptoethanoI, 600 mM Tris- 
25 HCl, pH 8.8, 20 mM MgCk, 170 mM (NH4)2S04, 20 mM nucleotide mix dNTP), 2.5 ^il 
of each primer at 2 \iM, 10 ng of template DNA, 10% DMSO and 0.5 unit pf Taq 
polymerase in a final volume of 12.5 |a1. Thermal cycling was performed on a PTC-100 
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amplifier (MJ Inc.) with an initial denaturation step of 90 s at 95''C, followed by 35 
cycles of 45 s at 95°C, 1 min at 60^C and 2 min at 72*^0. 

RFLP analysis. In brief, agarose plugs of genomic DNA prepared as previously 

5 described (5) were digested with either Asel Dral or Xba\ (NEB), then electrophoresed 
on a 1% agarose gel, and finally triansfen'ed to Hybond-C extra nitrocellulose membranes 
(Amersham). Different probes were amplified by PGR firom the M microti strain OV254 
or M tuberculosis H37Rv using primers for : 
esat-6 (esat-6F GTCACGTCCATTCATTCCCT (SEQ ID No 17); 

10 esat-6RATCCCAGTGACGTTGCCTT)(SEQIDN6 18), 

the RDl'"^^ flanking region (4340, 209F GCAGTGCAAAGGTGCAGATA (SEQ ID No 
19); 4354,701R GATTGAGACACTTGCCACGA (SEQ ID No 20)), or 
plcA (pIcA.intF CAAGTTGGGTCTGGTCGAAT (SEQ ID - No 21); plcA.int.R 
GCTAGCCAAGGTCTCCTGGT (SEQ ID No 22)). Amplification products were radio- 

15 labeled by using the Stratagene Prime-It n kit (Stratagene). Hybridizations were 
performed at 65^C in a solution containing NaCl 0.8 M, EDTA pH 8, 5 inM, sodium 
phosphate 50 mM pH 8, 2% SDS, IX Denhardfs reagent and 100 ng/ml salmon sperm 
DNA (Genaxis). Membranes were exposed to phosphorimager screens and images were 
digitalized by using a STORM phospho-iriiager. 

20 DNA sequence accession numbers* The nucleotide sequences that flank MiDl, MiD2, 
MiD3 as well as the junction sequence of RDl""*^ have been deposited in the EMBL 
database. Accession numbers are AJ345005, AJ345006, AJ315556 and AJ315557, 
respectively. 

25 1.2 RESULTS 

Establishment of a complete ordered BAC libraiy of M microti OV254. 
Electroporation of pBeloBACll containing partial Hindm digests of M microti OV254 
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DNA into Escherichia coli DHIOB yielded about 10,000 recombinant clones, from 
which 2,000 were isolated and stored in 96-well plates. Using the complete sequence of 
the M tuberculosis H37Rv genome as a scaffold, end-sequencing of 384 randomly 
chosen M microti BAG clones allowed us to select enougli clones to cover almost all of 

5 the 4.4 Mb chromosome. A few rare clones that spanned regions that were not covered 
by this approach were identified by PGR screening of pools as previously described (4). 
This resulted in a minimal set of 50 BACs, covering over 99.9% of the M microti 
OV254 genome, whose positions relative to M. tuberculosis H37Rv are shown in Figure 
5. The insert size ranged between 50 and 125 kb, and the recombinant clones were 

10 stable. Compared with otlier BAG libraries jfrom tubercle bacilli (4, 13) the M, microti 
OV254 BAG library contained clones that were generally larger than those obtained 
previously, which facilitated the comparative genomics approach, described below. 

Identification of DNA deletions in M microti OV254 relative to M tuberculosis 
15 H37Ry by comparative genomics. The minimal overlapping set of 50 BAG clones, 
together with the availability of three other ordered BAG libraries from M tuberculosis 
H37Rv, M bovis BGG Pasteur 1173P2 (5, 13) and M bovis AF2122/97 (14) allowed us 
to carry out direct BAG to BAG comparison of clones spanning the same genomic 
regions. Size differences of PFGE-separated HindUl restriction fragments from M 
. 20 microti OV254 BAGs, relatiye to restriction fragments from M. bovis and/or Mi 
tuberculosis BAG clones, identified loci that differed among the tested strains. Size 
variations of at least 2 kb were easily detectable and 10 deleted regions, evenly 
distributed around the genome, and containing more than 60 open reading frames 
(ORFs), were identified. These regions represent over 60 kb that are missing from M 
25 microti OV254 strain compared to M tuberculosis H37Rv. First, it was found that 
phiRv2 (RDll), one of the two M. tuberculosis YGlKv prophages was present in M 
microti OV254, whereas phiRvl, also referred to as RD3 (29) was absent. Second, it was 
. found that M microti lacks four of the genomic regions that were also absent from M 
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bovis BCG. In fact, these four regions of difference named RD7, RD8, RD9 and RDIO 
are absent from all members of the M tuberculosis complex with the exception of M 
tuberculosis and M canetti^ and seem to have been lost from a common progenitor 
strain of M africaimm, M microti and M bovis (3). As such, our. results obtained with 

5 individual BAG to BAG comparisons show that M microti is part of this non-M 
tuberculosis lineage of the tubercle bacilli, and this assumption was further confirmed by 
sequenciiig the junction regions of RD7 - RDIO in M microti OV254. The sequences 
obtained were identical to those from M q/ricahum, M bovis and M bovis BCG strains, 
Apart from^ these foiir conserved regions of difference, and phiRvl (RD3) M microti 

10 OV254 did not show any other RDs with identical junction regions to M bovis BCG 
Pasteur, which misses at least 17 RDs relative to M tuberculosis H37Rv (1, 13, 35). 
However, five other regions missing from the genome of M microti OV254 relative to 
M tuberculosis H37Rv were identified (RDr^ RD5"^^ MiDl, MiD2, MiD3). Such 
regions are specific either for strain OV254 or for M microti strains in general. 

15 Interestingly, two of these regions, RDl™^ RDS"^^ partially overlap RDs from the M 
bovisBCG. ' . 

Antigens iESAT-6 and CFP-10 are absent from M. microti. One of the most 
interesting findings of the BAG to BAG comparison was a novel deletion in a genomic 

20 region close to the origin of replication (figure 5). Detailed PGR and sequence analysis 
of this region in M microti OV254 showed a segment of 14 kb to be missing (equivalent 
to M tuberculosis H37Rv from 4340,4 to 4354,5 kb) that partly overlapped RDl^^^ 
absent from M bovis BCG. More precisely, ORFs Rv3864 and Rv3876 are truncated in 
M microti OV254 and ORFs RV3865 to Rv3875 are absent (figure 6). This observation 

25 is particularly interesting as previous comparative genomic analysis identified RDl^''^ as 
the only RD Vegion that is specifically absent from all BCG sub-strains but present in all 
other members of the M tuberculosis complex (1, 4, 13^ 29, 35). As shown in Figure 6, 
in M. microti OV254 the RDl""'*" deletion is responsible for the loss of a large portion of 
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the conserved ESAT-6 family core region (40) including the genes coding for the major 
T-cell antigens ESAT-6 and CFP-10 (2, 15). The fact that previous deletion screening 
protocols employed primer sequences that were designed for the right hand portion of 
the RDl^^ region (i.e. gene Rv3878) (6, 39) explains why the RDl"^^ deletion was not 
5 detected earlier by these investigations. Figure 6 shows that RDl'"^*^ does not afifect genes 
Rv3877, Rv3878 and Rv3879 which are part of the RDl ^""^ deletion. 

Deletion of phosphoIipase-C genes in M. fnicroti OV254. RDS*"'*^, the other region 
absent from M, microti OV254, that partially overlapped an RD region from BCG, was 
10 revealed by comparison of BAG clone Mil8A5 with BAG Rvl43 (figure 5). PGR 
analysis and sequencing of the junction region revealed that RD5"'*^ was smaller than the 
RD5 deletion in BGG (Table 2 and 3 below). 

TABLE2 

1 5 Description of the putative function of the deleted and truncated ORFs in M microti OV254 



Region 


Start - End 


overlapping ORF 


Putative Function or family 


RDIO 


264,5-266,5 


Rv0221-Rv0223 


echAl 


RD3 


1779,5-1788,5 


Rvl573-Rvl586 


bacteriophage proteins 


RD7 


2207,5-2220,5 


Rvl964-Rvl977 


yrbESA'SB; mceSA-F; unknown 


RD9 


2330-2332 


Rv2072-Rv2075 


cobL; probable oxidoreductase; unkno^vn 




2627,6-2633,4 


Rv2348-Rv2352 


pic A-C; member of PPE family 


MiDl 


3121,8-3126,6 


Rv2816-Rv2819 


IS5/7(?tTansposase; unknown 


MiD2 


3554,0-3755,2 


Rv3187-Rv3190 


IS(5i70transposase; unknown 


MiD3 


3741,1-3755,7 


Rv3345-Rv3349 


members of the PE-PGRS and PPE families; insertion 








elements - 


RD8 


4056,8-4062.7 


Rv3617.R:v3618 


ephA; IpqG; member of the PE-PGRS femily 



wo 03/085098 



PCT/IB03/01789 



42 

RD1°^ 4340,4-4354,5 Rv3864-Rv3876 member of the CBXX/CF QX family; member of the PE 

and PPE famiUes; ESAT-6; CFPIO; unknown 



5 TABLE 3. Sequence at the junction of the deleted regions in M. microti OV254 



Junction Position ORFs 



Sequences at the junction 



Flanking primers 



RDr" 4340,421- Rv3864. 
(SEQ ED 4354,533 Rv3876 
No 23) 



CAAGACGAGGTTGTAAAACCTCGACG 

CAGGATCGGCGATGAAATGCCAGTCG 

GCGTCGCTGAGCGCGCGCTGCG CCG^ 

GTCCCATTITGTCGCTGATITGTJTGAACA 

GCGACGAACCGGTGTTGAAAATGTCGCCT 

GGGTCGGGGATTCCCT 



4340,209F (SEQ ID No 19) 
GCAGTGCAAAGGTGCAGATA 
4354,70 IR (SEQ ID No 20) 
GATTGAGACACTTGCCACGA 



RD5°^° 2627,831- Rv2349- 
(SEQ ID 2635,581 Rv2355 
No 26) 



CCTCGATGAACCACCTGACATGACCC 
CATCCTTTCCAAGAACTGGAGTCTCC 
GGACATGCCGGGGCGGTTCACTGCCC 
CAGGTGTCCTGGGTCGTTCCGTTGACCGT 
CGAGTCCGAACATCCGTCATTCCCGGTGG 
CAGTCGGTGCGGTGAC 



2627,370F (SEQ ID No 24) 
GAATGGCGACGTCATATCG 
2633,692R (SEQ ID No 25) 
CGGCCACTGAGTTCGATTAT 



MiDl 3121,880. Rv2815c- 
(SEQ ID 3126,684 Rv2818c 

No 29) 



CACCTGACATGACCCCATCCTTTCCA 

AGAACTGGAGTCTCCGGACATGCCGG 

GGCGGTTCAGGGACATTCATGTCCATCIT 

CTGGCAGATCAGCAGATCGCTTGTTCTCAG 

TGCAGGTGAGTC 



3 121 ,690F (SEQ ID No 27) 
CAGCCAACACCAAGTAGACG 
3 126,924R (SEQ ID No 28) 
TCTACCTGCAGTCGCTTGTG 



MiD2 3554,066- Rv3188- 
(SEQ ID 3555,259 Rv3189 
No 32) 



GCTGCCTACTACGCTCAACGCCAGAG 
ACCAGCCGCCGGCTGAGGTCTCAGAT 
CAGAGAGTCTCCGGACTCACCGGGGC 
GGTLCATAAAGGCTTCGAGACCGGACGG 
GCTGTAGGTTCCrCAACTGTGTGGCGGAT 
GGTCTGAGCACTTAAC 



3553,880F (SEQ ID No 30) 
GTCCATCGAGGATGTCGAGT 
3555,385R (SEQ ID No 31) 
CTAGGCCATTCCGTTGTCTG 



MiD3 3741,139- Rv3345c- 
(SEQ ID 3755,777 Rv3349c 
No 35) 



TGGCGCCGGCACCTCCGTTGCCACCG 

TTGCCGCCGCTGGTGGGCGCGGTGCC 

GTTCGCCCCGGCCGAACCGtTCAGGG 

CCGGGTTCGCCCTCAGCCGCTAAACACG 

CCGACCAAGATCAACGAGCTACCTGCCCG 

GTCAAGGTTGAAGAGCCCCCATATCAGCA 

AGGGCCCGGTGTCGGCG 



3740,950F (SEQ ID No 33) 
GGCGACGCCATTTCC 
3755,988R (SEQ ID No 34) 
AACTGTCGGGCnTGCTCTT 



In fact, M. microti pV254 lacks the genes plcA, plcB, plcC and one specific PPE-protein 
encoding gene (Rv2352). This was confirmed by the absence of a clear band on a 
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Southern blot ofAsel digested genomic DNA from M microti OV254 hybridized with a 
pIcA probe. However, the genes Rv2346c and Rv2347c, members of the esat'6 family, 
and Rv2348c, that are missing from M, bovis and BCG strains (3) are still present in M 
microti OV254. The presence of an IS6110 element in this segment suggests that 
5 recombination between two 1S6110 elements could have been involved in the loss of 
RDS'"*^ and this is supported by the finding that the remaining copy of IS61 10 does not 
show a 3 base-pair direct repeat in strain OV254 (Table 3). 

Lack of MiDl provides genomic clue for M. microti OV254. characteristic 

10 spoligotype. MiDl encompasses the three ORFs Rv2816, Rv2817 and Rv2818 that 
encode putative proteins whose ftmctions are yet unknown, and has occurred in the 
direct repeat region (DR), a polymorphic locus in the genomes of the tubercle bacilli that 
contains a cluster of direct repeats of 36 bp, separated by unique spacer sequences of 36 
to 41 bp (17), (figure 7). The presence or absence of 43 unique spacer sequences that 

15 intercalate the DR sequences is the basis of spacer-oligo tj'ping, a powerful typing 
method for strains from the M tuberculosis complex (23). M microti isolates exhibit a 
charjacteristic spoligotype with an unusually small DR cluster, due to the presence of 
only spacers 37 and 38 (43). In M microti OV254, the absence of spacers 1 to 36, which 
are present in many other M tuberculosis complex strains, appears to result from an* 

20 IS6110 mediated deletion of 636 bp of the DR region. Amplification and P\mTl 
restriction analysis of a 2.8 kb fragment obtained with primers located in the genes that 
flank the DR region (Rv2813c and Rv2819) showed that there is only one copy of 
IS6110 remaining in this region (figure 7). This 1S6110 element is inserted into ORF 
Rv2819 at position 3,119,932 relative to the M tuberculosis H37Rv genome. As for 

25 Other IS6110 elements that result from homologous recombination between two copies 
(7), no 3 base-pair direct repeat was found for this copy of IS6J10 in the DR region. 
Concerning the absence of spacers 39-43 (figure7), it was found that M. microti showed 
a slightly different organization of this locus than M bovis strains, which also 



wo 03/085098 PCT/IB03/01789 



44 

characteristically lack spacers 39-43. In M microti OV254 an extra spacer of 36 bp was 
found that was not present in M bovis nor in M tuberculosis H37Rv. The sequence of 
this specific spacer was identical to that of spacer 58 reported by van Embden and 
colleagues (42). In their study of the DR region in many strains from the M. tuberculosis 
5 complex this spacer was only found in M microti strain NLA000016240 (AF189828) 
and in some ancestral M. tuberculosis strains (3, 42). Like MiDl, MiD2 most probably 
results from an IS6770-mediated deletion of two genes (Rv3188, Rv3189) that encode 
putative proteins whose ftinction is unknown (Table 3 above and Table 4 below). 

Iff ABLE 4. Presence of the RD and MiD regions in different M. microti strains 



. HOST VOLES HUMAN 



Strain 


OV2S4 


OV183 


OV216 


ATCC 


Myc 94 B3 


iB4 type 


Bl 


B2 










. 35782 


-2272 . 


type mouse 


mouse 


type llama 


type llama 




absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


RD3 


absent 


absent 


absent 


absent 


absent 


. absent 


absent 


absent 


absent 


RD7 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


RD8 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


RD9 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


RDIO 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


absent 


MiD3 


absent 


ND 


ND 


absent 


absent 


absent 


absent 


absent 


absent 


MiDl 


absent 


ND 


ND 


present 


partial 


partial 


partial 


present 


present 




absent 


absent 


absent 


present 


present 


present 


present 


preseiit 


present 


MiD2 


absent 


ND 


ND 


present 


present 


present 


present 


present 


present 



ND, not determined 
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Absence of some members of the PPE family in M. microti. MiD3 was identified by 
the absence of two Hindm sites in BAG Mi4B9 that exist at positions 3749 kb and 3754 
kb in the M tuberculosis H37Rv chromosome. By PGR and sequence analysis, it was 
determined that MiDS corresponds to a 12 kb deletion that has truncated or removed five 

5 genes orthologous to Rv3345c-Rv3349c, Rv3347c encodes a protein of 3157 amino- 
acids that belongs to the PPE family and Rv3346c a conserved protein that is also 
present its M leprae. The function of both these putative proteins is unknown while 
Rv3348 and Rv3349 are part of an insertion element (Table 2). At present, the 
consequences of the MiDS deletions for the biology of M microti remains entirely 

10 unknovm- 

Extra-DNA in M microti Oy254 relative to M. tuberculosis H37Rv. M. microti 
OV254 possesses the 6 regions RvDl to RvD5 and TBDl that are absent from the 
sequenced strain M tuberculosis H37Rv, but which have been shown to be present in 

15 other members of the M tuberculosis complex, like M canettii, M. qfricanum, M ba\ns, 
and M bovis BGG (3, 7, 13). In M tuberculosis H37Rv, four of these regions (RvD2-5) 
contain a copy of IS6110 which is not flanked by a direct repeat, suggesting that 
recombination of two 1S6110 elements was involved in the deletion of the intervening 
genomic regions (7). In consequence, it seems plausible that these regions were deleted 

20 from the M tuberculosis H37Rv genome rather than specifically acquired by M microti. 
In addition, three other small insertions have also been found and they are due to the 
presence of an 186110 element in a different location than in M tuberculosis H37Rv and 
M bovis AF2 122/97. hideed, PvuQ RFLP analysis of M microti OV254 reveals 13 
1S6110 elements (data not shown). 

Genomic diversity of M microti strains. In order to obtain a more global picture of the 
genetic organization of the taxon M microti we evaluated the presence or absence of the 
variable regions found in strain OV254 in eight other M microti strams. These strains 
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which were isolated from humans and voles have been designated as M microti mainly 
on the basis of their specific spoligotype (26, 32, 43) and can be further divided into 
subgroups according to the host such as voles, llama and humans (Table 3). As stated in 
the introduction,- M microti is rarely found in humans unlike M tuberculosis. So the 

5 availability of 9 strains from variable sources for genetic characterization is an 
exceptional resource. Among them was one strain (Myc 94-2272) from a severely 
immuno-compromised individual (43), and four strains were isolated from HIV-positive 
or HTV-negative humans with spoligotypes typical of llama and mouse isolates. For one 
strain, ATCC 35872 / M.P. Prague, we could not identify with certainty the original host 

10 from which the strain was isolated, nor if this strain corresponds to M microti OV166, 
that was received by Dr. Sula from Dr. Wells and used thereafter for the vaccination 
program in Prague in the 1960's (38). 

First, we were interested if these nine strains designated as M microti on the basis of 
15 their spoligotypes also resembled each other by other molecular typing criteria. As RFLP 
of pulsed-field gel separated chromosomal DNA represents probably the most accurate 
molecular typing strategy for bacterial isolates, we determined the Asel profiles of the 
. available M microti strains, and found that the profiles resembled each other closely but 
differed significantly from the macro-restriction patterns of M tuberculosis, M bovis 
20 and M bovis BCG strains used as controls. However, as depicted in Figure 8 A, the 
patterns were not identical to each other and each M microti strain showed subtle 
differences, suggesting that they were not epidemiologically related. A similar 
observation was made v/ith other rare cutting restriction enzymes, like Z>ral orXbal (data 
not shown). 
25 . 

Common and diverging features of M microti strains. Two strategies were used to 
test for the preserice or absence of variable regions in these strains for which we do not 
have ordered BAC libraries. First, PCRs using internal and flanking primers of the 
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variable regions were employed and amplification products of the junction regions were 
sequenced. Second, probes jfrom the internal portion of variable regions absent from M. 
microti OV254 were obtained by amplification of M, tuberculosis H37Rv DNA using 
specific primers. Hybridization with these radio-labeled probes was carried out on blots 
5 from PFGE separated Asel restriction digests of the M. microti strains. In addition, we 
confirmed the findings obtained by these two techniques by using a focused macro-array, 
containing some of the genes identified in variable regions of the tubercle bacilli to date 
(data not shown). 

10 This led to the finding that the RDl'"*'' deletion is specific for ail M. microti strains 
tested. 

Indeed, none of the M microti DNA-digests hybridized with the radio-labeled esat-C 
probe (Fig. 8B) but with the RDl"''**' flanking region (Fig. 8C). In addition, PGR 
amplification using primers flanking the RDl"^*^ region (Table 2) yielded fragments of 

15 the same size for M microti strains whereas no products were obtained for M 
tuberculosis^ M, bovis and M bovis BCG strains (Fig. 9). Furthermore, the sequence of 
the junction region was found identical among the strains which confirms that the 
genomic organization of the RDl™*^ locus was the same in all tested M microti strains 
(Table 3). This clearly demonstrates that M. microti lacks the conserved ESAT-6 family 

20 core region stretching in other members of the M tuberculosis complex from Rv3864 to 
Rv3876 and, as such, represents a taxon of naturally occurring ESAT-6 / CFP-10 
deletion mutants. 

Like RDl™*^ MiD3 was found to be absent from all nine M microti strains tested and, 
25 therefore, appears to be a specific genetic marker that is restricted to M microti strains 
(Table 3). However, PCR amplification showed that RD5""*^ is absent only from the vole 
isolates OV254, OV216 and OV183, but present in the M microti strains isolated from 
human and other origins (Table 3). This was confirmed by the presence of single bands 
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but of differing sizes on a Southern blot hybridized with a plcA probe for all M microti 
tested strains except OV254 (Fig. SD). Interestingly, the presence or absence of RDS"*'*" 
correlated with the similarity of IS6110 RFLP profiles. The profiles of the three M, 
microti strains isolated from voles m the UK differed considerably from the 1S6110 

5 RFLP patterns of humans isolates (43). Taken together, these results underiine the 
proposed involvement of IS6110 mediated deletion of the RD5 region and further 
suggest that RD5 may be involved in the variable potential of M microti strains to cause 
disease in humans. Similarly, it was found that MiDl was missing only from the vole 
isolates OV254, OV216 and OV183, which display the same spoligotype (43), 

10 confirming the observations that MiDl confers the particular spoligotype of a group of 
M. microti strains isolated from voles. In contrast, PGR analysis revealed that MiDl is 
only partially deleted from strains B3 and B4 both characterized by the mouse 
spoligotype and the human isolate microti Myc 94-2272 (Table 3). For strain ATCC 
35782 deletion of the MiDl region was not observed. These findings correlate with the 

15 described spoligotypes of the different isolates, as strains that had intact or partially 
deleted MiDl regions had more spacers present than the vole isolates that only showed 
spacers 37 and 38. 

2.3 COMMENTS AND DISCUSSION 

20 

We have searched for major genomic variations, due to insertion-deletion events, 
between the vole pathogen, M microti, and the human pathogen, M, tuberculosis. BAG 
based comparative genomics led to the identification of 10 regions absent from the 
genome of the vole bacillus M microti OV254 and . several insertions due to IS6110. 
25 Seven of these deletion regions were also absent from eight other M microti strains, 
isolated from voles or humans, and they account for more than 60 kb of genomic DNA, 
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Of these regions, RDl""^ is of particular interest, because absence of part of this region 
has been found to be restricted to the BCG vaccine strains to date. As M microti was 
originally described as non pathogenic for humans, it is proposed here that RDl genes is 
involved in the pathogenicity for humans. This is reinforced by the fact that RDl^^^ (29) 
5 has lost putative ORFs belonging to the esat-6 gene cluster including the genes encoding 
. ESAT-6 and CFP-10 (Fig. 6) (40). Both polypeptides have been shown to act as potent 
stimulators of the immune system and are antigens recognized during the early stages of 
infection (8, 12, 20, 34). Moreover, tlie biological importance of this RDl region for 
mycobacteria is underlined by the fact that it is also conserved in M leprae, where genes 
10 ML0047-ML0056 show high similarities in their sequence and operon organization to 
the genes in the esat'6 core region of the tubercle bacilli (1 1). In spite of the radical gene 
decay observed in M leprae the esat-6 operon apparently has kept its functionality in 
this organism. 

15 However, the RDl deletion may not be the only reason why the vole bacillus is 
attenuated for humans. Indeed, it remains unclear why certain M microti strains included 
in the present study that show exactly the same RDl"^'*^ deletion as vole isolates, have 
been found as causative agents of human tuberculosis. As human M. microti cases are 
extremely rare, the most plausible explanation for this phenomenon would be that the 

20 infected people were particularly susceptible for mycobacterial infections in general. 
This could have been due to an immunodeficiency (32, 43) or to a rare genetic host 
predisposition such as interferon gamma- or IL-12 receptor modification (22). 

In addition^ the finding that human M, microti isolates differed from vole isolates by the 
25 presence of region RDS"^'*^ may also have an impact on the increased potential of human 
M microti isolates to cause disease. Intriguingly, BCG and the vole bacillus lack 
overlapping portions of this chromosomal region that encompasses three (plcA^ plcB^ 
plcC) of the four genes encoding phospholipase C (PLC) in M tuberculosis. PLC has 
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been recognized as an important virulence factor in numerous bacteria, including 
Clostridium perfringens^ Listeria monocytogenes and Pseudomonas aeruginosa^ where 
it plays a role in cell to cell spread of bacteria, intracellular survival, and cytolysis (36, 
41). To date, the exact role of PLC for the tubercle bacilli remains unclear. encodes 
5 the antigen mtp40 which has previously been shown to be absent from seven tested vole 
and hyrax isolates (28). Phospholipase C activity in M tuberculosis^ M microti and M 
bovis, but not in M bovis BCG, has been reported (21, 47). However, PLC and 
sphingomyelinase activities have been found associated with the most virulent 
mycobacterial species (21). The levels of phospholipase C activity detected in M bovis 

10 were much lower than those seen in M. tuberculosis consistent with the loss of plcABC. 
It is likely, that plcD is responsible for the residual phospholipase C activity in strains 
lacking RD5, such as M. bovis and M microti OV254. Indeed, the plcD gene is located 
in region RvD2 which is present in some but not all tubercle bacilli (13, 18). 
Phospholipase encoding genes have been recognized as hotspots for integration of 

15 IS6110 and it appears that the regions RD5 and RvD2 undergo independent deletion 
processes more frequently than any other genomic regions (44). Thus, the virulence of 
some M microti strains may be due to a combination of functional phospholipase C 
encoding genes (7, 25, 26, 29). 

20 Another intriguing detail revealed by this study is that among the deleted genes seven 
code for members of the PPE family of Gly-, Ala-, Asn-rich proteins. A closer look at 
the sequences of these genes showed that in some cases they were small proteins with 
unique sequences, like for example Rv3873, located in the RDl"*^ region, or Rv2352c 
and Rv2353c located in tlie RD5"'''^ region. Others, like Rv3347c, located in the MiD3 

25 region code for a much larger PPE protein (3157 aa). In this case a neighboring gene 
(Rv3345c), belonging to another multigene family, the PE-PGRS family, was partly 
affected by the MiD3 deletion. While the function of the PE/PPE proteins is currently 
unknown, their predicted abundance in the proteome of A£ tuberculosis suggests that 



wo 03/085098 



PCT/IB03/01789 



51 

they may play an important role in the life cycle of the tubercle bacilli. Indeed, recently 
some of them were shown to be involved in the pathogenicity of M tuberculosis strains 
(9). Complementation of such genomic regions in M microti OV254 should enable us to 
carry out proteomics and virulence studies in animals in order to understand the role of 
5 such ORFs in pathogenesis. 

In conclusion, this study has shown that M microti^ a taxon originally named after Its 
major host Microtus agrestis^ the common vole, represents a relatively homogenous 
group of tubercle bacilli. Although all tested strains showed unique PFGE macro- 

10 restriction patterns that differed slightly among each otfier, deletions that were common 
to all M microti isolates (RD7-RD10, MiD3, RDl™^ have been identified. The 
conserved nature of these deletions suggests that tliese strains are derived from a 
common precursor that has lost these regions, and their loss may account for some of the 
observed common phenotypic properties of M microti, like the very slow growth on 

15 solid media and the formation of tiny colonies. This finding is consistent with results 
from a recent study that showed that M. microti strains carry a particular mutation in the 
gyr5gene(31). 

Of particular interest, some of these common features (e.g. the flanking regions of 
20 RDl"", or MiD3) could be exploited for an easy-to-perform PCR identification test, 
similar to the one proposed for a range of tubercle bacilli (33). This test enables 
unambiguous and rapid identification of M microti isolates in order to obtain a better 
estimate of the overall rate of M microti infections in humans and other mammalian 
species, 
25 ' . 

Example 3: Recombinant BCG exporting ESAT-6 confers enhanced protection 

against tuberculosis 
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3.1 Complementation of the RDl locus of BCG Pasteur and M microti 

To construct a recombinant vaccine tliat secretes both ESAT-6 and CFP-10, we 

5 complemented BCG Pasteur for the RDl region using genomic fragments spanning 
variable sections of the esxBA (or ESAT-6) locus from M tuberculosis (Fig. 10). Tlie 
RDl deletion in BCG interrupts of removes nine CDS and affects all four transcriptional 
units: three are removed entirely while the fourfli (Rv3867-Rv3871) is largely intact 
apart from the loss of 112 codons from the 3'-end of Rv3871 (Fig. 10). Transcriptome 

10 analysis of BCG, performed using cDNA probes obtained from early log phase cultures 
with dligonucleotide-based microarrays, wras able to detect signals at least two fold 
greater than background for the probes corresponding to Rv3867 to 3871 inclusive,- but 
not for the RDl -deleted genes Rv3872 to Rv3879. This suggests that the Rv3 867-3871 
transcriptional unit is still active in BCG which, like M bovis^ also has frameshifts in the 

15 neighbouring gene, Rv3881 (Fig. 10). The RDl'"''' deletion of M microti removes three 
transcriptional units completely with only gene Rv3877 remaining from the fourth. The 
M tuberculosis clinical isolate MT56 has lost genes Rv3878-Rv3879 (Brosch, R., et al 
eyolutionary scenario for the Mycobacterium tuberculosis complex. Proc Natl 
AcadSci USA 99, 3684-9. (2002)) but still secretes ESAT-6 and CFP-10 (Fig. 10). 

20 . 

To test the hypothesis that a dedicated export machinery exists and to establish which 
geiies were essential for creatmg an ESAT-6-CFP-10 secreting vaccine: we assembled a 
series of integrating vectors carrying fragments spanning different portions of the RDl 
esx gene cluster (Fig. 10). These integrating vectors stably insert into the ottB site of the 
25 genome of tubercle bacilli. pAP34 was designed to carry only the antigenic core region 
encoding ESAT-6 and CFP-10, and the upstream PE and PPE genes, whereas RD1-I106 
and RDl-pAP35 were selected to include the core region and either the dovmstream or 
upstream portion of the gene cluster, respectively. The fourth construct RD1-2F9 
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contains a - 32 kb segment from M tuberculosis that stretches from Rv3861 to Rv3885 
covering the entire RDl gene cluster. We adopted this strategy of complementation with 
large genomic fragments to avoid polar effects that might be expected if a putative 
protein complex is only partially complemented in trans. In addition, a set of smaller 
. 5 expression constructs (pAP47, pAP48) was established in which individual genes are 
transcribed from a heat shock promoter (Fig. 10). Using appropriate antibodies all of 
these constructs were found to produce the corresponding proteins after transformation 
of BCG or M microti (see below). 

10 3.2 Several genes of the esx cluster are required for export of ESAT-6 and CFP-10 

The four BCG::RD1 recombinants (BGG::RDl-pAP34, BCG::RDl-pAP35, BCG::RD1- 
2F9 and BCG::RD1-I106) (Fig. 1 1) were initially tested to ensure that ESAT-6 and CFP- 
10 were being appropriately expressed from the respective integrated constructs. 

15 Immunoblotting of whole cell protein extracts from mid-log phase cultures of the 
various BCG::RD1 recombinants using an ESAT-6 monoclonal antibody or polyclonal 
sera for CFP-10 and the PPE68 protein Rv3873 demonstrated that all three proteins were 
expressed from the four constructs at levels comparable to those of M tuberctdosis (Fig. 
1 1). However, striking diiSferences were seen when the supematants from early log-phase 

20 cultures of each recombinant were screened by Western blot for the two antigens. 
Although low levels of ESAT-6 and CFP-10 could be detected in the concentrated 
supernatant protein fractions of BCG::RDl-pAP34, BCG::RDl-pAP35 and BCG::RD1- 
1106 it was only with the integrated construct encompassing the entire esx gene cluster 
(BCG::RD1-2F9) that the two antigens accumulated in significant amounts. The high 

25 concentrations of ESAT-6 and CFP-10 seen in the supernatant of the recombinant 
BCG::RD1-2F9 were not due to a non-specific increase in permeability, or loss of cell 
wall material, because when the same whole cell and supernatant protein fractions were 
immunoblotted with serum raised against Rv3873, this protein was only localized in the 



wo 03/085098 



PCT/IB03/01789 



54 

cell wall of the various recombinants. As expected, when constructs were used 
containing esxA or esxBA alone, ESAT-6 did not accumulate in the culture supernatant 
(data not shown). 
* 

5 To assess the effect of the RDl'"''' deletion of M microti on the export of ESAT-6 and 
CFP-10 and subsequent antigen handling, the experiments were replicated in this 
genomic background. As with BCG, ESAT-6 and CFP-10 were only exported mto the 
supernatant fraction in significant amounts if expressed in conjunction with the entire 
esx cluster (Fig. 1 1). The combined findings demonstrate that complementation with 

10 esxA or esxB alone is insufficient to produce a recombinant vaccine that secretes these 
two antigens. Rather, secretion requires expression of genes located both upstream and 
downstream of the antigenic core region confirming our hypothesis 20 that the consers'ed 
esx gene cluster does indeed encode functions essential for the export of ESAT-6 and 
CFP-10. 

15 . 

3.3 Secretion of ESAT-6 is needed to induce antigen specific T-cell responses 

Since the classical observation that inoculation with live, but not dead BCG, confers 
protection against tuberculosis in animal models it has been considered that secretion of 
antigens is critical for maximizing protective T-cell immunity. Using our panel of 

20 recombinant vaccines we were able to test if antigen secretion was indeed essential for 
eliciting ESAT-6 specific T-cell responses. Groups of C57/BL6 mice were inoculated 
subcutaneously with one of six recombinant vaccines (BCG-pAP47, BCG-pAP48, 
BCG::RDl-pAP34, BCG::RDl-pAP35, BCG::RDI-I106, BCG::RD1-2F9) or with BCG 
transformed with the empty vector pYUB412. Three weeks following vaccination, T-cell 

25 immune responses to the "seven vaccines were assessed by comparing antigen-specific 
splenocyte proliferation and gamma mterferon (EFN-y) production (Fig. 12A). As 
Mticipated all of the vaccines generated splenocyte proliferation and IFN-y production in 
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response to PPD (partially purified protein derivative) but not against an unrelated MalE 
control peptide indicating successful vaccination in each case. However, only 
splenocytes from the mice inoculated with BCGiiRDl-2F9 proliferated markedly in 
response to the immunodominant ESAT-6 peptide (Fig. 12A). Furthemiore, IFN-D was 
5 only detected in culture supematants of splenocytes from mice immunized with 
. BCG::RD1-2F9 following incubation with the ESAT-6 peptide (Fig 12B) or 
recombinant CFP-10 protein (data not shown). These data demonstrate that export of the 
. antigens is essential for stimulating specific Th I -oriented T-cells. 

Further characterization of the immune responses was carried out. Splenocytes from 
10 mice immunized with BCG::RD1-2F9 or control BCG both proliferated in response to 
the immunodominant antigen 85A peptide (Fig 13 A). The strong splenocyte 
proliferation in the presence of ESAT-6 was abolished by an anti-CD4 monoclonal 
antibody but not by anti-CDS indicating that the CD4^ T-cell subset was involved (Fig. 
13B). Interestuigly, as judged by in viti'o BFN-y response to PPD and the ESAT peptide, 
15 subcutaneous immunization generated much stronger T-cell responses (Fig. 13C) 
compared to intravenous injection. After subcutaneous immunisation with BCG::RD1- 
2F9 strong ESAT-6 specific responses were also detected in inguinal lymph nodes (data 
not shown). These experiments demonstrated that the ESAT-6 T-cell immune responses 
to vaccination with BCG::RD1-2F9 were potent, reproducible and robust making this 
20 recombinant an excellent candidate for protection studies. 

3.4 Protective efficacy of BCG: :RD1-2F9 in immuno-competent mice 

When used alone as a subunit or DNA vaccine, ESAT-6 induces levels of protection 
weaker than but akin to those of BCG (Brandt, L., Elhay, M., Rosenkrands, L, Lindblad, EB. & 
Andersen, P. ESAT-6 subunit vaccination against Mycobacterium tuberculosis. Infect Inimiin, 68, 791- 
25 795 (2000)). Thus, it was of interest to determine if the presentation to the immune system 
of ESAT-6 and/or CFP-10 in the context of recombinant BCG, mimicking the 
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presentation of the antigens during natural infection, could increiase the protective 
efficiency of BCG. The BCG::RD1-2F9 recombinant was therefore selected for testing 
as a vaccine, since it was the only ESAT-6 exporting BCG that elicited vigorous antigen 
specific T-cell immune responses. Groups of C57BL/6 mice were inoculated 
5 intravenously with either BCG::RD1-2F9 or BCG::pYUB412 and challenged 
intravenously after eight weeks with M tuberculosis H37Rv. Growth of M iubereidosis 
H37Rv in spleens and lungs of each vaccinated cohort was compared with that of 
un vaccinated cdritrols two months after infection (Fig. 14A). This demonstrated that, 
compared to vaccination with BCG, the BCG::RD1-2F9 vaccine inhibited growth of M 
10 tuberculosis H37Rv in the spleens by 0.4 loglO CFtJ and was of compiarable efficacy at 
protecting the lungs. 

To investigate this enhanced protective ejBFect against tuberculosis further we repeated 
the challenge experiment using the aerosol route. In this experiment antibiotic treatment 
was employed to clear persisting BCG fi-om mouse organs prior to infection with M 

15 tuberculosis. Two months following vaccination C57BL/6 mice were treated with daily 
rifampicin/izoniazid for three weeks and then infected with 1000 CFU of M 
tuberculosis H37Rv by the respiratory route. Mice were then sacrificed after 17, 35 and 
63 days and bacterial enumeration carried out on the lungs and spleen. This 
demonstrated that, even following respiratory infection, vaccination with BCG::RD1- 

20 2F9 was . superior to vaccination with the control strain of BCG (Fig. 14B). However, 
growth of M tuberculosis was again only inhibited strongly in the mouse spleens. 



Example 4; Protective efficacy of BCG::RP1«2F9 in guinea pigs 

4.1 Animal models M tuberculosis H37Rv and the different recombinant vaccines 
25 were prepared in the same manner as for the immunological assays. For the guinea pig 
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assays, groups of outbred female Dunkin-Hartley guinea pigs (David Hall, UK) were 
inoculated with 5 x 10"* CPUs by the subcutaneous route. Aerosol challenge was 
performed 8 weeks after vaccination using a contained Henderson apparatus and an 
H37Rv (NCTC 7416) suspension in order to obtain an estimated retained inhaled dose of 
5 approximately 1000 CFU/lung (Williams, A., Davies, A, Marsh, P.D., Chambers, M.A & 
Hewinson, R.G. Comparison of the protective efficacy of bacille calmette-Guerin vaccination against 
aerosol challenge with Mycobacterium tuberculosis and Mycobacterium bovis. Clin Infect Dis 30 SuppI 3, 
8299-301. (2000)). Organs were homogenized and dilutions plated out on 7H11 agar, as for 
the mice experiments. Guinea pig experiments were carried out in the framework of the 
1 0 European Union TB vaccme development program. 



4.2 Results Although experiments in mice convincingly demonstrated a superior 
protective efficacy of BCG::RD1 over BCG it was important to establish a similar effect 
in the guinea pig model of tuberculosis. Guinea pigs are exquisitely sensitive to 

15 tuberculosis, succumbing rapidly to low dose infection with M tuberculosis^ and 
develop a necrotic granulomatous pathology closer to that of human tuberculosis. 
Immunization of guinea pigs with BCG::RD1-2F9 was therefore compared to 
conventional BCG vaccination. Groups of six guinea pigs were inoculated 
subcutaneously with salme, BCG or BCG::RD1-2F9. Eight weeks following inoculation 

20 the three guinea pig cohorts were challenged with M tuberculosis H37Rv via the aerosol 
route. Individual animals were weighed wieekly and were killed 17 weeks after challenge 
or earlier if they developed signs of severe tuberculosis. Whereas all unvaccinated 
guinea pigs failed to thrive and were euthanised before the last time-point because of 
overwhelming disease, both the BCG- and recombinant BCG::RDl-2F9-vaccinated 

25 animals progressively gained weight and were clinically well when killed on termination 
of the experiment (Fig. ISA). This indicated that although the BCG::RD1-2F9 
recombinant is more virulent in severely immunodeficient mice (Pym, A.S., Brodin, P., 
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Brosch, R., Huerre, M. & Cole, ST. Loss of RDl contributed to the attenuation of the 
live tuberculosis vaccines ^^ycobacterh4m bovis BCG and Ji^^cobacterium microti. MoL 
Microbiol 46, 709-717 (2002)). there is no increased pathogenesis in the highly 
susceptible guinea pig model of tuberculosis. Moreover, when the bacterial loads in the 

5 spleens of the vaccinated animals were compared there was a greater than ten-fold 
reduction in the number of CFU recovered from the animals immunised with 
BCG::RD1-2F9 when compared to BCG (Fig. 15B). Interestingly, there was no 
significant difference between the number of CFU obtained from the lungs of the two 
vaccinatied groups indicating that the organ-specific enhanced protection observed in 

10 mice vaccinated with BCG::RD1-2F9 was also seen with guinea pigs. This marked 
reduction of bacterial loads in the spleens of BCG::RD1-2F9 inununised animals was 
also reflected in the gross pathology. Visual examination of the spleens showed that 
tubercules were much larger and more numerous on the surface of the BCG-vaccinated 
guinea pigs (Fig. 15C). These results demonstrate that the recombinant vaccine 

15 BCG::RD1-2F9 conveys enhanced protection to an aerosol challenge with M 
/Mfeercwteis in two distinct animal models. 

GENERAL CONCLUSION 

20 Tuberculosis is still one of the leading infectious causes of death in the world despite a 
decade of improving delivery of treatment and control strategies (Dye, C, Scheele, S., 
Dolin, P., Pathania, V. & Raviglione, M.C. Consensus statement. Global burden of 
tuberculosis: estimated incidence, prevalence, and mortality by country. WHO Global 
Surveillance and Monitoring Project Jama 282, 677-86. (1999)). Reasons for the 

25 . recalcitrance of this pandemic are multi-factorial but include the modest efficacy of the 
widely used vaccine, BCG. Two broad approaches can be distinguished for the 
development of improved tuberculosis vaccines (Baldwin, S.L., et al Evaluation of new 
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vaccines in the mouse and guinea pig model of tuberculosis. Infection & Immunity 66, 
2951-9 (1998), Kaufmann, S.H. How can immunology contribute to the control of 
tuberculosisDJVb/z/r^ Rev Immunol 1, 20-30. (2001) and Young, D.B. & Fruth, U. in New 
Generation Vaccines (eds. Levine, M., Woodrow, G., Kaper, J. & Cobon GS) 631-645 
5 (Tv^arcel Dekker, 1997)). These are the development of subunit vaccines based on 
purified protein antigens or new live vaccines that stimulate a broader range of immune 
responses. Although a growing list of individual or combination subunit vaccines, and 
hybrid proteins, have been tested none has yet proved superior to BCG in anunal models 
(Baldwin, S.L., et al, 1998). Similarly, new attenuated vaccines derived fi*om virulent 

10 M tuberculosis have yet to out-perform BCG (Jackson, M., et al Persistence and 
protective efficacy of a A^cobacterium tuberculosis auxotroph vaccine. Infect Immun 
67, 2867-73. (1999) and Hondalus, M.K., et al Attenuation of and protection mduced by 
a leucine auxotroph of Ad^cobacterium tuberculosis. Infect Immun 68, 2888-98. (2000)). 
hiterestingly, the only vaccine that appears to surpass BCG is a BCG recombinant over 

15 expressing antigen 85A (Horwitz, M.A., Harth, G., Dillon, BJ. & Maslesa-Galic, S. 
Recombinant bacillus calmette-guerin (BCG) vaccines expressing the Mycobacterium 
tuberculosis 30-kDa major secretory protein induce greater protective immunity against 
tuberculosis than conventional BCG vaccines in a highly susceptible animal model. Froc 
Natl Acad Sci U SA 97, 13853-8. (2000)). The basis for this vaccme was the notion that 

20 over-expression of an immunodominant T-cell antigen could quantitatively enhance the 
BCG-elicited immune response. 

In frame with the invention, we were able to show that restoration of the RDl locus did 
indeed improve the protective efficacy of BCG and defines a genetic modification that 
25 should be included in new recombinant BCG vaccines. Moreover, we were able to 
demonstrate two fiirther findings that will be crucial for tlie development of a live 
vaccine against tuberculosis. First, we have identified the genetic basis of secretion for 
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the ESAT-6 family of immunodominant T-cell antigens, and second, we show that 
export of these antigens from the cytosol is essential for maxunizing their antigenicity. 

The extra-cellular proteins of M tuberculosis have been extensively studied and shown 
5 to be a rich source of protective antigens (Sorensen, A.L., Nagai, S., Hoiien, G., 
Andersen, P. & Andersen, A.B. Purification and characterization of a low-molecular- 
mass T-cell antigen secreted by Mycobacterium tuberculosis. Infect Immim 63, 1710-7 
(1995), Skj0t, R.L.V., et al Comparative evaluation of low-molecular-mass proteins 
from Mycobacterium tuberculosis identifies members of the ESAT-6 family as 

10 immunodominant T-cell antigens, brfect. Immun. 68, 214-220 (2000), Horwitz, M.A., 
Lee, B.W-, Dillon, B.J. & Harth, G. Protective immunity against tuberculosis induced by 
vaccination witli major extracellular proteins of ^dycobacterium tiiberculosis. Proc Natl 
AcadSci USA 92, 1530-4 (1995) and Boesen, H., Jensen, B.N., Wilcke, t, & Andersen, 
P. Human T-cell responses to secreted antigen fractions of Mycobacterium tuberculosis. 

15 Infect Immun 63, 1491-7 (1995)). Despite this it remains a mystery how some of these 
proteins, that lack conventional secretion signals, are exported from the cytosol, a unique 
problem in M tuberculosis given the impermeability and waxy nature of the 
mycobacterial cell envelope. Although two secA orfhologues were identified in the 
genome sequence of M tuberculosis (Cole, S.T-, et al Deciphering the biology of 

20 Mycobacterium tuberculosis from the complete genome sequence. Nature 393, 537-544 
(1998)), no genes for obvious type I, U, or HI protein secretion systems were detected, 
like those that mediate the virulence of many Gram-negative bacterial pathogens (Finlay, 
B.B. & Falkow, S. Common themes in microbial pathogenicity revisited. Microbiol. 
Mol Biol Rev. 61, 136-169 (1997)). This suggested that novel secretion systems might 

25 exist: An in silico analysis of the M. tuberculosis proteome identified a set of proteins 
and genes whose inferred functions, genomic organisation and strict association with the 
esx gene family suggested that they could constitute such a system (Tekaia, F., et al 
Analysis of the proteome of Afycobacterium tuberculosis in silico. Tubercle Lung 
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Disease 79, 329-342 (1999)). Our results provide the first empirical evidence that this 
gene cluster is essential for the normal export of ESAT-6 and CFP-10. 

The antigen genes, esxBA, lie at the centre of the conserved gene cluster. Bioinformatics 
5 and comparative genomics predicted that both the conserved upstream genes Rv3868- 
Rv3871, as well as the downistream genes Rv3876-Rv3877, would be required for 
secretion (Fig. 1) and strong experimental support for this prediction is provided here. 
Our experiments show that only when BCG or M microti are complemented with the 
entire cluster is maximal export of ESAT-6 and CFP-10 obtained. This suggests that at 

10 least Rv3871 and either Rv3876 or Rv3877 are indeed essential for the normal secretion 
of ESAT-6 as these are the only conserved genes absent or disrupted in BCG which are 
not complemented by RD1-I106 or RDl-pAP35. These genes encode a large 
transmembrane protein with ATPase activity, an ATP-dependent chaperone and an 
integral membrane protein, functional predictions compatible with them being part of a 

15 multi-protein complex involved in the translocation of polypeptides. Amongst the 
proteins encoded by the esx cluster Rv3871 and Rv3877 are highly conserved, as 
orthologues have been identified in the more streamlined clusters found in other 
actinomycetes, further supporting their direct role in secretion (Gey Van Pittius, N.C., et 
ah The ESAT-6 gene cluster of Mycobacterium tuberculosis and other high G+C Gram- 

20 positive bacteria. Geitome Biol 2, 44.1-44.18 (2001)). It has been shown recently that 
ESAT-6 and CFP-10 form a heterodimer in vitro (Renshaw, P.S.^ et al Conclusive 
evidence that the major T-cell antigens of the M tuberculosis complex ESAT-6 and 
CFP-10 form a tight, 1:1 complex and characterisation of the structural properties of 
ESAT-6, CFP-10 and the ESAT-6-CFP-10 complex: implications for pathogenesis and 

25 virulence, J Biol Chem 8, 8 (2002)) but it is not known whether dimerisatioil precedes 
translocation across the cell membrane or occurs at a later stage in vivo. In either case, 
chaperone or protein clamp activity is likely to be required to assist dimer formation or 
to prevent premature complexes arising as is well documented for type III secretion 
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systems (Page, A.L. & Parsot, C. Chaperones of the type III secretion pathway: jacks of 
all trades. Mol Microbiol 46, 1-11. (2002)). These, and other questions concerning the 
precise roles of the individual components of the ESAT-6 secretory apparatus, can now 
be addressed experimentally using the tools developed here. 
5 - • : ^ - 

The second major finding of the invention is that the secretion of ESAT-6 (and probably 
CFP-10) is critical for inducing maximal T-cell responses although other RDl -encoded 
proteins may also contribute such as the PPE68 protein (Rv3873) which is located in the 
cell envelope. We show that even tfiough whole cell expression levels of ESAT-6 are 

10 comparable amongst our vaccines (Fig. 2), only the vaccine strain exporting ESAT-6, via 
an intact secretory ajpparatus, elicits powerful T-cell responses. Surprisingly, even the 
recombinants RDl-pAP47 and RDl-pAP48, that overexpress ESAT-6 intracellularly, 
did not generate detectable ESAT-6 specific T-cell responses. Although antigen 
secretion has long been recognized as. important for inducing immunity against M 

15 tuberculosis, and is often used to explain why killed BCG offers no protection, this is 
one of the first fomial demonstrations of its importance. BCG, like M. tuberculosis 
resides in the phagosome, where secreted antigens have ready access to the MHC class II 
antigen processing pathway, essential for inducing IFN-y producing CD4 T-cells 
considered critical for protection against tuberculosis. . Further understanding of the 

20 mechanism of ESAT-6 secretion could allow the development of BCG recombinants that 
deliver other antigens in the same way. 

The main aim of the present invention was to qualitatively enhance the antigenicity of 
BCGi So, having assembled a recombinant vaccine that secreted the T-cell antigens 
25 ESAT-6 and CFP-10, and shown that it elicited powerful Cr)4 T-cell immunity against 
at least ESAT-6 and CFP-10, the next step was to rigorously test its efficacy in animap 
models of tuberculosis. In three distinct models, including two involving respiratory 
challenge, we were able to demonstrate that the ESAT-6-CFP-10 secreting recombinant 
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improved protection when compared to a BCG control, although this effect was 
restricted to the spleen. This is probably due to the fact that the enhanced immunity 
induced by the two additional antigens is insufficient to abort the primary infection but 
does significantly reduce the dissemination of bacteria from the lung. The lack of 

5 protection afforded to the lung, the portal of entry for M tuberculosis, does not prevent 
BCG::RD1-2F9 from being a promising vaccine candidate. Primary tuberculosis occurs 
in the middle and lower lobes and is rarely symptomatic (Garay, S.M. in Tuberculosis 
(eds. Rom, W.N. & Garay, S.MO 373-413 (Little, Brown and Company, Boston, 1996)). 
The bacteria need to reach the upper lobes, the commonest site of disease, by 

10 haematogenous spread. Therefore, a vaccine that inhibits dissemination of M 
tuberculosis from the primary site of infection would probably have major impact on the 
outcome of tuberculosis. 

Recombinant BCG vaccines have definite advantages over other vaccination strategies in 
that they are inexpensive, easy to produce and convenient to store. However, despite an 

15 tmrivalled and enviable safety record concerns remain and BCG is currently not 
administered to individuals with HIV infection. As shown above, the recombinant 
BCG::RD1-2F9 grows more rapidly in Severe Combined Immunodeficient (SCID) mice, 
an extreme model of immunodeficiency, than its parental BCG strain. However, in both 
immunocompetent mice and guinea pigs we have not observed any increased pathology 

20 only a slight increase in persistence which may be beneficial, since the declining efficacy 
of BCG with serial passage has been attributed to an inadvertent increase in its 
attenuation OSehr, M.A- & Small, P.M. Has BCG attenuated to mvpoXoxxo^UNature 389, 
133-4. (1997)). 

Ultimately, the robust enhancement in protection we have observed with the 
25 reincorporation of the RDl locus is a compelling reason to include this genetic 
modification in any recombinant BCG vaccine, even if this may require the need for a 
balancing attenuating mutation. 
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In summary, the data presented here show that, in addition to its increased persistence, 
BCG::RD1-2F9 induces specific T-cell memory and enhances immune responses to 
other endogenous Thl antigens such as the mycoloyi transferase, antigen 85A. 
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